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The MAILING DATE of this communication appears on the cover sheet with the correspondence address - 

Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) OR THIRTY (30) DAYS, 
WHICHEVER IS LONGER, FROM THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1.1 36(a). In no event, however, may a reply be t.mely filed 

. area - «- - — t (6) sssss^ communlcation - 

Failure to reolv within the set or extended period lor reply will, by statute, cause the appl.cat.on to become ABANDONED (35 U.S.C. § 133). 
' Any reply S i by the Office later than three months atter the mailing date of this communication, even if ..mely f.led, may reduce any 
earned patent term adjustment. See 37 CFR 1.704(b). 

Status 

I )|3 Responsive to communication(s) filed on 24 May 2006. 

2a)D This action is FINAL. 2b)IS This action is non-final. 

3) D Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 11, 453 O.G. 213. 

Disposition of Claims 

4) E3 Claim(s) 1-55. 57-59. 61-85 is/are pending in the application. 

4a) Of the above claim(s) 4 5 8 9. 16.17.28-3 * 4142.44-55.57.58.61.62. 64-85 is/are withdrawn from 

consideration. 

5) D Claim(s) is/are allowed. 

6) EI Claim(s) 1-3 & 7. 10-15 18-27. 34-40 43. 59. and 63. is/are rejected. 

7) D Claim(s) is/are objected to. 

8) D Claim(s) are subject to restriction and/or election requirement. 

Application Papers 

9) D The specification is objected to by the Examiner. 

10)D The drawing(s) filed on is/are: a)D accepted or b)D objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1 .85(a). 
Replacement drawing sheet(s) including the correction is required if the drawing(s) is objected to. See 37 CFR 1.121(d). 

I I )□ The oath or declaration is objected to by the Examiner. Note the attached Office Action or form PTO-1 52. 

Priority under 35 U.S.C. § 119 

12)D Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 1 19(a)-(d) or (f). 
a)D All b)D Some * c)D None of: 

1 .□ Certified copies of the priority documents have been received. 
2.D Certified copies of the priority documents have been received in Application No. 

. • I * tl 



3D Copies of the certified copies of the priority documents have been received in this National Stage 
application from the International Bureau (PCT Rule 17.2(a)). 
* See the attached detailed Office action for a list of the certified copies not received. 
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DETAILED ACTION 

1 . Applicant's election without traverse of Group I, and species of an antibody, 
lgG1 , targeting CD20, 239D substitution, increased affinity for FcyR and no 
carbohydrate modification, filed 05/24/2006, is acknowledged. 

Claims 56 and 60 have been canceled. 
Claims 1 -55, 57-59, and 61 -85 are pending. 

Claims 4, 5, 8, 9, 16, 17, 28, 29, 30-33, 41, 42, 44-55, 57-58, 61, 62, and 64-85 
have been withdrawn from further consideration by the Examiner, under 37 C.F.R. 
1 .142(b), as being drawn to nonelected inventions. 

Claims 1-3, 6, 7, 10-15, 18-27, 34-40, 43, 59, and 63, read on an antibody, lgG1, 
targeting CD20, 239D substitution, increased affinity for FcyR and no carbohydrate 
modification, are currently under consideration. 

2. The instant application appears to be in sequence compliance for patent 
applications containing nucleotide sequence and/or amino acid sequence disclosures. 

3. Applicant's claim for domestic priority under 35 U.S.C. 1 19 (e) is acknowledged. 
However, the provisional applications USSNs 60/41 4,433 and 60/442,301 upon which 
priority is claimed fail to provide adequate support under 35 U.S.C. 1 12 for claims 3, 6, 
7, 14, 15, 26, 27, 39, 40, 43, and 63 of this application. Specifically, insufficient support 
was identified for the limitation of "position.... _239". Consequently, the claims have 
been accorded the priority of the filing date of the priority applications USSNs 
60/467,606 (05/02/2003) and 60/477,839 (06/1 2/2003). 
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Should applicant disagree with the Examiner's factual determination above, it is 
incumbent upon applicant to provide a showing that specifically supports the instant 
claim limitations. 

4. Applicant's IDS, filed 05/17/2006, is acknowledged and have been considered 
except References B20 (WO 00/61739), B21 (WO 01/29246), B27 (WO 02/30954), and 
B28 (WO 02/31 140) for which have only been considered to the extent of the English 
translation of the Abstracts. 

5. The application is required to be reviewed and all spelling, TRADEMARK, and 
like error corrected. 

Trademarks should be capitalized or accompanied by the ™ or ® symbol 
wherever they appear and be accompanied by the generic terminology. Although the 
use of trademarks is permissible in patent application, the proprietary nature of the 
trademarks should be respected and every effort made to prevent their use in any 
manner which might adversely affect their validity as trademarks. 

Appropriate correction is required. 

6. The following is a quotation of the second paragraph of 35 U.S.C. 112: 

The specification shall conclude with one or more claims particularly pointing out and distinctly 
claiming the subject matter which the applicant regards as his invention. 

7. Claims 1 , 3, 6, 7, 1 0-1 3, 1 8-27, 34, 35, 43, and 59 are rejected under 35 
U.S.C. 112, second paragraph, as being indefinite for failing to particularly point out and 
distinctly claim the subject matter which applicant regards as the invention. 
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A) Claims 1 , 3, 6, 7, 10, 1 1 , 1 8-27, 34, 35, and 59 are indefinite in its recitation of 
"modulate" because it is ambiguous as to the direction (positive or negative) or degree 
of the effect of the said "modulate". The term "modulate" is not defined by the claims, 
the specification does not provide a standard form ascertaining the requisite degree, 
and one of ordinary skill in the art would not be reasonably apprised of the metes and 
bounds of the invention. 



B) Claim 34 is indefinite in the recitation of "effector functions" because the metes 
and bounds of the effector functions is not clear and ambiguous. For example, page 25 
of the instant specification discloses certain "effector function" include antibody 
dependent cell-medialted cytotoxicity (ADCC), antibody dependent cell-mediated 
phagocytosis (ADCP). However, it is unclear as to which "effector function" or the 
requisite structural/functional characteristic is/are intended or encompassed by the 
claimed polypeptide. It is suggested to amend the claim to recite the "effector functions" 
encompassed by the claimed binding agent. See claim 36 and/or page 25 of the 
specification for example. 



C) Claims 6, 12, 13, 18, and 43 are indefinite in the recitation of "substantially 
human" and/or "substantially mouse, substantially rat or substantially monkey" because 
the metes and bounds of the claims is unclear and ambiguous. The phrase is not 
defined by the claims, the specification does not provide standard for ascertaining the 
requisite degree, and one of ordinary skill in the art would not be reasonably apprised of 
the scope of the invention. 
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D) Claims 23-27 are indefinite in the recitation of "FcyRllla-fold: FcyRllb-fold 
ratio" because the phrase fails to particularly point out and distinctly claim the subject 
matter which applicant regards as the invention. It is unclear as to what "FcyRllla-fold: 
FcyRI lb-fold ratio" is intended or encompassed by the claimed polypeptide. The phrase 
is not defined by the claims, the specification does not provide standard for ascertaining 
the requisite degree, and one of ordinary skill in the art would not be reasonably 
apprised of the scope of the invention. 

E) Applicant is reminded that any amendment must point to a basis in the 
specification so as not to add new matter. See MPEP 714.02 and 2163.06. 

8. The following is a quotation of the first paragraph of 35 U.S.C. 1 1 2: 

The specification shall contain a written description of the invention, and of the manner and process of 
making and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the 
art to which it pertains, or with which it is most nearly connected, to make and use the same and shall 
set forth the best mode contemplated by the inventor of carrying out his invention. 

9. Claims 1-3, 6, 7, 10-15, 18-27, 34-40, 43, and 59 are rejected under 35 U.S.C. 
112, first paragraph, as failing to comply with the enablement requirement. The claim(s) 
contains subject matter which was not described in the specification in such a way as to 
enable one skilled in the art to which it pertains, or with which it is most nearly 
connected, to make and/or use the invention. 

Claims 1-3, 6, 7, 10-15, 18-27, 34-40, 43, and 59 recite "a polypeptide 
comprising an Fc variant" as part of the invention. 

The specification discloses on pages 27-29 that a "a polypeptid e comprising an 
Fc variant" according to the definition of a variant polypeptide meant an Fc sequence 
that differs from a parent Fc sequence by at least one amino acid modification, e.g. from 
about one to about ten amino acid modification, or possess at least 80% homology with 
a parent polypeptide sequence. 
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The specification does not enable any person skilled in the art to which it 
pertains, or with which it is most nearly connected, to make and use the invention 
commensurate in scope with these claims. 

The specification does not provide a sufficient enabling description of the claimed 
invention. The disclosure appears to show only antibodies with certain specified amino 
acid substitutions. For example, the specification discloses engineered antibodies such 
as rituximab, alemtuzumab, and trastuzumab with amino acid substitutions in the Fc 
region (see Example 2 on pages 128-130 of the specification as-filed). The instant 
claims encompass in their breadth anv "a polypeptide comp rising an Fc variant" 
comprising with at least one amino acid substitution. 

However, there does not appear to be sufficient guidance in the specification as 
field as to how the skilled artisan would make and use the claimed "variant Fc". The 
state of the art at the time the invention was made recognized that even single amino 
acid differences can result in drastically altered function of antibodies. For example, 
Lund et al. (The Journal of Immunology 1996, 157:4963-4969. Reference C10 on IDS) 
show that even a single amino acid replacement within the CH2 domain of IgG can alter 
the glycosylation profile of an antibody therefore influence its effector functions of Fc 
receptor binding and complement activation (see entire document, particularly 
Discussion on pages 4966-4968). Further, Lazar et al. (WO 03/074679) teach that the 
determinants of antibody properties, such as stability, solubility and affinity for antigen, 
important to its functions are overlapping; thus engineering an antibody to be more 
soluble may cause a loss in affinity for its antigen (see entire document, particularly 
page 3). 

Given the extensive variation permitted by the instant claim language, the skilled 
artisan would not reasonably predict such "a polypeptide comprisin g an Fc variant" to 
have the same function as the instant claimed invention. 
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Reasonable correlation must exist between the scope of the claims and scope to 
enablement set forth. Applicant does not appear to provide guidance as to other 
polypeptide comprising an Fc variant" which meets the claimed limitation of exhibits 
altered binding to an FcyR. 

In addition, it is unpredictable if functional activity will be shared by two 
polypeptides having 80% identity over the full length of their sequences. The 
specification does not appear to provide sufficient guidance as to which residues should 
or should not be changed to preserve any particular function. Although the specification 
does provide working examples of antibodies such as antibodies with specific 
position(s) in the Fc region altered (e.g. see Figure 11-13), the variation permitted by 
the instant claim language is extensive. 

However, there does not appear to be sufficient guidance in the specification as 
filed as to how the skilled artisan would make and use the claimed "a polypeptide 
comprising an Fc variant" . The specification provides no direction or guidance 
regarding how to produce "a polypeptide comprising an Fc variant" as broadly defined 
by the claims. 

In view of the lack of guidance in the specification and in view of the discussion 
above one of skill in the art would be required to perform undue experimentation in 
order to practice the claimed invention. 

Applicant is invited to consider amending the claimed Fc vari ant/polype ptide to 
antibody and/or Immunoadhesin as disclosed on pages 22-26 of the instant 
specification to obviate this rejection. 
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10. Claims 1-3, 6, 7, 10-15, 18-27, 34-40, 43, and 59 are rejected under 35 U.S.C. 
112, first paragraph, as failing to comply with the written description requirement. The 
claims contain subject matter which was not described in the specification in such a way 
as to reasonably convey to one skilled in the relevant art that the inventor(s), at the time 
the application was filed, had possession of the claimed invention. 

The following written description rejection is set forth herein. 

Claims 1-3, 6, 7, 10-15, 18-27, 34-40, 43, and 59 recite "a polypeptide 
comprising an Fc variant" as part of the invention. 

There is insufficient written description in the specification as-filed of ^a 
polypeptide comprising an Fc variant" as recited in the instant claims. 

The written description requirement for a claimed genus may be satisfied through 
sufficient description of a representative number of species by actual reduction to 
practice, reduction to drawings, or by disclosure of relevant, identifying characteristics, 
i.e., structure or other physical and/or chemical properties, by functional characteristics 
coupled with a known or disclosed correlation between function and structure, or by a 
combination of such identifying characteristics, sufficient to show the applicant was in 
possession of the claimed genus. (See Federal Register, Vol. 66, No. 4, pages 1099- 
1111, Friday January 5, 2001 , especially page 1 1 06 3 rd column). A "representative 
number of species" means that the species which are adequately described are 
representative of the entire genus. Thus, when there is substantial variation within the 
genus, one must describe a sufficient variety of species to reflect the variation within the 
genus. MPEP 2163 II.A.3a.ii. 
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The claims recite a genus "a polypeptide comprising an Fc variant" as part of the 
invention without providing a physical structure or testable functional activity for the ^ 
polypeptide comprising an Fc variant". 

The genus of the "a polypeptide comprising an Fc variant" are therefore very 
large. Applicant has disclosed only antibodies with certain amino acid modifications at 
the Fc region (e.g. see Example 2 on pages 128-130). Thus Applicant has disclosed 
only a limited species of the "a polypeptide comprising an Fc variant" , namely 
antibodies. The claimed "a polypeptide comprising an Fc variant' lack a common 
structure essential for their function and the claims do not require any particular 
structure basis or testable functions be shared by the instant "a polypeptide comprising 
an Fc variant" . 

Mere idea or function is insufficient for written description; isolation and 
characterization at a minimum are required. 

It is well known in the art that effector functions such as ADCC is a process 
where the Fey receptors of the natural killer cells and other leukocytes bind to antibody- 
coated target cells and destroy them (Burton et al. Human antibody effector function. 
Advances in Immunology, 1992, 51:1-84. See entire document, particularly Figure 18). 
The antibodies mediating ADCC must have Fc region for binding of Fc receptors as well 
as regions for binding of target cells. 

It does not appear based upon the limited disclosure of antibodies alone that 
Applicant was in possession of the necessary common attributes or features of the 
elements possessed by the members of the genus in view of the limited number of 
species disclosed and the extensive variation permitted within the genus of % 
polypeptide comprisin g an Fc variant". 
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"Adequate written description requires a precise definition, such as by structure, 
formula, chemical name or physical properties, not a mere wish or plan for obtaining the 
claimed chemical invention." Regents of the University o f California v. Eli Lilly and Co. 
43 USPQ2d, 1398, (Fed. Cir. 1997). 

The disclosure must allow one skilled in the art to visualize or recognize the 
identity of the subject matter of the claim. Id, 43 USPQ2d at 1406. 

in thP ahfignpft of disclosure of relevant, identify ing characteristics of the !a 
polypeptide comprising an Fc variant" , there is insufficient written disclosure under 35 
U.S.C. 112, first paragraph. 

Applicant is directed to the Guidelines for the Examination of Patent Applications 
Under the 35 U.S.C. 112, H 1 "Written Description" Requirement, Federal Register, Vol. 
66, No. 4, pages 1099-1 111, Friday January 5, 2001 . 

Applicant is reminded that Vas-Cath makes clear that the written description 
provision of 35 USC 1 12 is severable from its enablement provision (see page 1115). 

Applicant is invited to consider amending the claimed Fc variant/polypeptide to 
antibody and/or immunoadhesin as disclosed on pages 22-26 of the instant 
specification to obviate this rejection. 

1 1 . The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 
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(e) the invention was described in (1) an application for patent, published under section 122(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351(a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 

12. Claims 1-3, 6, 7, 10-15, 18-27, 34-40, 43, 59, and 63 are rejected under 35 
U.S.C. 102(b) as being anticipated by Presta (WO 00/42072. Reference B1 on IDS) 
(See entire document). 

Presta teaches polypeptide comprising a variant Fc region such as antibody and 
immunoadhesin. Specifically, Presta teach that the Fc regions of an antibody and 
immunoadhesin can be modified by amino acid substitutions at positions such as 239 
for altered binding affinity to FcyRs (see entire document, particularly Summary of the 
Invention on pages 5-8). Further, Presta teaches that the Fc region can be human lgG1 
and the amino acid substitution can be replacement of any naturally occurring amino 
acid residues e.g. Asp (D) (e.g. see pages 14-15). Furthermore, Presta teaches that 
the antibody can be produced using CD20 as antigen and can be further formulated in a 
pharmaceutical composition (see pages 35-54, in particular). 

Given the reference antibody variant comprises amino acid substitution at the 
same position (position 239) as the claimed polypeptide variant comprising an Fc 
variant, the claimed functional limitations associated with the polypeptide variant would 
be inherent properties of the reference antibody. 

Therefore, the reference teachings anticipate the claimed invention. 

13. Claims 1-3, 6, 7, 10-15, 18-27, 34-40, 43, 59, and 63 are rejected under 35 
U.S.C. 102(e) as being anticipated by Presta (US Patent 6,737,056. Reference A97 on 
IDS) (see entire document). 
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Presta teaches and claims polypeptide variant e.g. lgG1 with altered effector 
function comprising amino acid substitutions at positions including 239 (see entire 
document, particularly Detailed Description of the Preferred Embodiments on columns 
9-47 and claims 1-14). Further, Presta teaches that the Fc region can be human lgG1 
and the amino acid substitution can be replacement of any naturally occurring amino 
acid residues e.g. Asp (D) (e.g. see column 12). Furthermore, Presta teaches that the 
antibody can be produced using CD20 as antigen and can be further formulated in a 
pharmaceutical composition (e.g. see columns 29-44). 

Given the reference antibody variant comprises amino acid substitution at the 
same position (position 239) as the claimed polypeptide variant comprising an Fc 
variant, the claimed functional limitations associated with the polypeptide variant would 
be inherent properties of the reference antibody. 

Therefore, the reference teachings anticipate the claimed invention. 

14. The nonstatutory double patenting rejection is based on a judicially created 
doctrine grounded in public policy (a policy reflected in the statute) so as to prevent the 
unjustified or improper timewise extension of the "right to exclude" granted by a patent 
and to prevent possible harassment by multiple assignees. A nonstatutory 
obviousness-type double patenting rejection is appropriate where the conflicting claims 
are not identical, but at least one examined application claim is not patentably distinct 
from the reference claim(s) because the examined application claim is either anticipated 
by, or would have been obvious over, the reference claim(s). See, e.g., In re Berg, 140 
F.3d 1428, 46 USPQ2d 1226 (Fed. Cir. 1998); In re Goodman . 1 1 F.3d 1046, 29 
USPQ2d 2010 (Fed. Cir. 1993); In re LonoL 759 F.2d 887, 225 USPQ 645 (Fed. Cir. 
1985); In re Van Ornum . 686 F.2d 937, 214 USPQ 761 (CCPA 1982); In re VoaeL 422 
F.2d 438, 164 USPQ 619 (CCPA 1970); and In re Thorinaton , 418 F.2d 528, 163 
USPQ 644 (CCPA 1969). 
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A timely filed terminal disclaimer in compliance with 37 CFR 1.321(c) or 1.321(d) 
may be used to overcome an actual or provisional rejection based on a nonstatutory 
double patenting ground provided the conflicting application or patent either is shown to 
be commonly owned with this application, or claims an invention made as a result of 
activities undertaken within the scope of a joint research agreement. 

Effective January 1 , 1994, a registered attorney or agent of record may sign a 
terminal disclaimer. A terminal disclaimer signed by the assignee must fully comply with 
37 CFR 3.73(b). 

15. Claims 1 -3, 6, 7,10-15,1 8-27, 34-40, 43, 59, and 63 are provisionally rejected 
on the ground of nonstatutory obviousness-type double patenting as being unpatentable 
over: 

Claims 38-43 45 and 46 of copending USSN 1 1/483,378, 
Claims 38, 40, and 49-57 of copending USSN 1 1/483,250, 
Claims 1-42 of copending USSN 10/822,231, 
Claims 1-12 of copending USSN 11/124,620, and 
Claims 1-19 of the copending USSN 1 1/396,495. 

Although the conflicting claims are not identical, they are not patentably distinct 
from each other because both the instant and the copending application claims are 
drawn to same or nearly the same polypeptide variants with the same modifications to 
the Fc region at position 239 for altered affinity for FcyRs and effector functions. Given 
polypeptide variants rely on the same amino acid modification, the instant claims and 
the conflicting claims would anticipate or render obvious of one another. It is further 
noted that the instant claims 1 , 2, 10-13, 18-25, 34-38, and 59 recite a genus of 
polypeptide variants comprising Fc variants with at least one amino acid modification 
without reciting specific amino acid residues. In turn, the species recited in copending 
claims would thus anticipate the genus of the copending claims. 
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This is a provisional obviousness-type double patenting rejection because the 
conflicting claims have not in fact been patented. 

16. Claims 1-3, 6, 7, 10-15, 18-27, 34-40, 43, 59, and 63 are directed to an invention 
not patentably distinct from claims 38-43 45 and 46 of commonly assigned USSN 
11/483,378; and claims 1,2, 10-13, 18-25, 34-38, and 59 are directed to an invention 
not patentably distinct from claims 1-12 of commonly assigned USSN 1 1/124,620, and 
claims 1 -1 9 of the commonly assigned USSN 1 1/396,495 for reasons stated above. 

The U.S. Patent and Trademark Office normally will not institute an interference 
between applications or a patent and an application of common ownership (see MPEP 
Chapter 2300). Commonly assigned USSNs 1 1/483,378, 1 1/396,495, and 1 1/124,620, 
discussed above, would form the basis for a rejection of the noted claims under 35 
U.S.C. 103(a) if the commonly assigned case qualifies as prior art under 35 U.S.C. 
102(e), (f) or (g) and the conflicting inventions were not commonly owned at the time the 
invention in this application was made. In order for the examiner to resolve this issue, 
the assignee can, under 35 U.S.C. 103(c) and 37 CFR 1 .78(c), either show that the 
conflicting inventions were commonly owned at the time the invention in this application 
was made, or name the prior inventor of the conflicting subject matter. 

A showing that the inventions were commonly owned at the time the invention in 
this application was made will preclude a rejection under 35 U.S.C. 103(a) based upon 
the commonly assigned case as a reference under 35 U.S.C. 102(f) or (g), or 35 U.S.C. 
102(e) for applications pending on or after December 10, 2004. 



17. No claim is allowed. 
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1 8. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Chun Crowder whose telephone number is (571) 272- 
8142. The examiner can normally be reached Monday through Friday from 8:30 am to 
5:00 pm. A message may be left on the examiner's voice mail service. If attempts to 
reach the examiner by telephone are unsuccessful, the examiner's supervisor, Christina 
Chan can be reached on (571) 272-0841. The fax number for the organization where 
this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). .^T^^Kk^ 



PHILLIP GAMBEL, PH.D TO 
PRIMARY EXAMINER 




Chun Crowder, Ph.D. 




Patent Examiner 



July 18, 2006 
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ANTIBODY OPTIMIZATION 



This application claims the benefit of the filing date of Serial No. 60/360,843. filed March 1, 2002 and 
Serial No. 60/384,197, filed May 29, 2002, both of which are expressly incorporated by reference in 
10 their entirety. 



FIELD OF THE INVENTION 



The present invention relates to the use of computational screening methods to optimize the physico- 
15 chemical properties of antibodies, including stability, solubility, and antigen binding affinity. 

BACKGROUND OF THE INVENTION 

Monoclonal antibodies are in widespread use as therapeutics, diagnostics, and research reagents. 
20 As therapeutics, antibodies are used to treat a variety of conditions including cancer, autoimmune 
diseases, and cardiovascular disease. There are currently over ten approved antibody products on 
the US market, with over a hundred in development. Despite such acceptance and promise, there 
remains significant need for optimization of the structural and functional properties of antibodies. 

25 The physical and chemical properties of antibody therapeutics significantly determine their 

performance during development, manufacturing, and clinical use. Antibodies may suffer from the 
stability and solubility issues similar to all proteins. Since fully developed antibody therapeutics 
require high levels of stability and solubility in order to retain activity through purification, formulation, 
• storage, and administration, there is a need for effective methods to optimize antibody properties. 

30 Antibodies may be exposed to a variety of stresses, for example changes in temperature or pH, that 
may cause protein unfolding, destroy activity, or make the protein sensitive to proteolytic degradation. 
Proteins may be reengineered such that structure and activity are substantially more robust with 
respect to such stresses, for example, by optimizing intramolecular and interdomain interactions and 
by altering protease recognition sites. 

35 

Solubility is also of critical importance to antibody efficacy. Antibodies are typically formulated and 
administered at high concentration, conditions under which antibodies may form aggregates. 
Aggregates typically have poor activity and bioavailability, and are associated with increased 
immunogenicity. Solubility may also dictate which routes of administration are feasible. In many 
40 cases, antibody therapeutics have been limited to intravenous administration, because the antibody is 
not sufficiently soluble to allow formulation of an effective dose in the small volumes that are used for 
alternate routes of administration. In most cases, solubility obstacles have been considered as 
formulation problems that may be surmounted with exhaustive protein chemistry effort. However, 
such methods are inefficient, inconsistent, and time-consuming, often failing to yield soluble protein 
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5 even following a significant expenditure of resources. Engineering approaches are beginning to 
emerge for the generation of soluble proteins; for example, in some cases solubility may be improved 
by replacing solvent exposed nonpolar residues with structurally compatible polar residues. 

Another property of antibodies that frequently demands optimization is antigen-binding affinity. The 
10 binding affinity of an antibody for its biological target is a critical parameter for therapeutic efficacy. 
One particular case in which higher affinity is often sought is following humanization, herein defined 
as the reengineering of nonhuman antibodies to be more human-like in sequence. Humanization is 
carried out to reduce the immunogenicity of antibody therapeutics, but often results in loss of binding 
affinity for antigen. Regaining this affinity is typically desired during drug development. The main 
15 approach for enhancement of antigen affinity, herein referred to as affinity maturation, involves the 
engineering of mutations at positions that either directly contact antigen or indirectly influence binding. 
The demand for increased affinity for antigen is not, however, limited to humanization. Affinity 
maturation is frequently desired for therapeutic antibodies in general, whether they are derived from 
human, humanized, chimeric, or nonhuman sources. 

20 

Strategies for antibody optimization are sometimes carried out using random mutagenesis. In these 
cases positions are chosen randomly, or amino acid changes are made using simplistic rules. For 
example all residues may be mutated to alanine, referred to as alanine scanning. This can be used, 
for example, to map the antigen binding residues of an antibody (Kelley etal, 1993, Biochemistry 

25 32:6828-6835; Vajdos et a/., 2002, J, Mol Biol 320:41 5-428). The high level of sequence and 

structural similarity and large amount of sequence and structural information enable sequence-based 
methods of optimization. For example, sequence analysis has allowed significant characterization of 
the determinants of antibody stability and solubility (Ewert et al, 2003, J. Mol. Biol 325:531-553; 
Ewert etal t 2003, Biochemistry 42:1517-1528), and can enable sequence-based methods of affinity 

30 maturation (see, US 2003/0022240A1 and US 2002/01 771 70A1, both hereby incorporated by 

reference). Sequence and structural Information can be coupled with site-directed mutagenesis to 
engineer antibodies with enhanced biophysical properties (Worn & PlOckthun, 2001 , J. Mol Biol 
305:989-1 01 0; Wirtz & Steipe, 1 999, Protein Sci. 8:2245-2250). More sophisticated engineering 
approaches for implementing antibody optimization strategies employ selection methods to screen 

35 higher levels of sequence diversity. As is well known in the art, there are a variety of selection 

technologies which may be used for such approaches, including, for example, display technologies 
such as phage display, ribosome display, yeast display, and the like. Selection methods coupled with 
random or rational mutagenesis have found utility for optimizing antibody stability (Jung et a/., 1999, J. 
Mol Biol 294:163-180) and particularly for affinity maturation (Wu et a/., 1999, J. Mol Biol 294:151- 

40 162; Schier et a/., 1996, J. Mol Biol 255:28-43). 

Despite some success, these current engineering strategies for antibody optimization suffer from 
three main obstacles. First, the level of sequence diversity that is wanted or needed can dramatically 
exceed that which is accessible by these technologies. The number of possible protein sequences 

2 
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5 grows exponentially with the number of positions that are randomized. Practical considerations 
including experimental and physical constraints such as transformation efficiency, instrumentation 
limits, and the like can significantly limit library size. Even for methods capable of screening large 
combinatorial libraries, this presents an obstacle. For example, the upper limit of diversity accessible 
by phage display is approximately 10°, which limits mutations to 7 positions if a fully random (all 20 
10 amino adds) library is used. 

A second limitation of current antibody engineering efforts is that experimental screens used to 
assess the fitness of antibody variants are not efficient, and therefore engineering optimized 
antibodies can be time- and resource- intensive, with no guarantee of success. Nor do current 
15 experimental screens always have the capacity to be implemented as a selection. For example, 
antibody stability is not a property that is readily selected for using a display technology. Screening 
for more stable antibodies would require purifying individual variants and determining their 
thermodynamic stability using time consuming biophysical methods. 

20 A final limitation of current antibody engineering efforts is that constraints on proteins are not distinct. 
Instead, the determinants of antibody stability, solubility, and affinity for antigen are overlapping and 
the interactions that contribute to these properties are related. Thus, affinity maturation of an antibody 
may result in decreased stability, and optimization of an antibody's solubility may cause a loss in 
affinity for its antigen. This issue has Important ramifications for antibody engineering because 

25 current experimental antibody optimization methods are poorly suited for simultaneous optimization of 
multiple, related properties. Consequently, a large portion of the candidates in experimental libraries 
are unsuitable. For example, a large fraction of sequence space encodes unfolded, misfolded, 
incompletely folded, partially folded, or aggregated proteins. Even among sequences that are folded 
and active, many will be less active, less soluble, or less stable than the wild type protein. In effect, 

30 current antibody engineering efforts generate experimental libraries that are composed of a large 

amount of "wasted" sequence space. More significantly, the probability of finding a suitable sequence 
decreases dramatically as the number of properties that are considered increases. Thus, there is a 
need for computational screening methods to optimize the physico-chemical properties of antibodies, 
including stability, solubility, and antigen binding affinity. 

35 

SUMMARY OF THE INVENTION 

The present invention provides methods of computational screening that may be applied to enhance 
the stability of antibodies, the solubility of antibodies, and the affinity of antibodies for antigen. 

40 

More specifically, the present invention discloses a method for optimizing at least one physico- 
chemical property of an antibody, wherein the method is executed by a computer under the control of 
a program, and the computer including a memory for storing said program, said method comprising 
the steps of: a. receiving a template antibody structure; b. selecting at least one variable position 



3 



WO 03/074679 



PCT/US03/06598 



5 which belongs to said template antibody structure; c. selecting at least one amino acid to be 

considered at said variable positions; d. analyzing the interaction of each of said amino acids at each 
variable position with at least part of the remainder of said antibody, including said amino acids at 
other variable positions; and e. identifying a set of at least one antibody sequence with at least one 
optimized physico-chemical property. 

10 

The method of the present invention also optionally includes generating a library from the set of at 
least one antibody sequence and experimentally screening the library. 

Computational screening methods have demonstrated their utility and success for the optimization of 
1 5 a broad array of protein properties. Application of these methods to antibodies represents a 

significant improvement because there are well known and established engineering strategies that are 
uniquely suited to antibodies. Computational screening is a hypothesis-driven method for engineering 
proteins, and thus the validity of the employed design strategies are critical to success. The 
application of these established engineering strategies as computational screening design strategies 
20 is not necessarily straightforward. However, as will be provided in detail, a number of aspects and 
parameters of the computational screening method may be adjusted to enable implementation of 
established antibody engineering strategies. Because all antibodies share a common structural 
template and high sequence similarity, and because of the enormous amount of sequence and 
structural information available, successful design strategies for the use of computational screening to 
25 optimize antibody stability, solubility, and affinity for antigen are broadly applicable to the entire family 
of antibodies. Finally, antibodies are often comprised of multiple similar domains. As a result, 
computational screening methods are uniquely modular for antibodies, that is to say that optimizations 
can be applied in an additive manner to engineer antibodies with a breadth of simultaneously 
enhanced functional and biophysical properties in multiple structural regions. 

30 

Computational screening methods of the present invention overcome the limitations of current 
antibody engineering methods. These methods are capitalizing on enormous recent advances in 
understanding of protein structure and function, substantial increases in the availability of high- 
resolution structures, and dramatic improvements in computing power. These methods offer a 

35 mechanism to explore sequence combinations that extend far beyond natural diversity, up to 10 50 or 
more sequences. Computational screening also enables the exploration of combinatorial complexity 
in the absence of experimentally selectable function, and thus biophysical properties such as stability 
and solubility, which are difficult to screen or select for, may be rationally screened in silico. Finally, 
computational screening methods offer the ability to aigorithmically couple multiple constraints for 

40 simultaneous optimization of several protein properties. Thus experimental libraries that are designed 
using computational screening are composed primarily of productive sequence space. Computational 
screening may enrich experimental libraries with quality diversity, whether such experimental libraries 
are small such that members may be screened individually, or they are large such that selection 
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5 methods are required for screening. As a result, computational screening Increases the chances of 
identifying antibodies that are broadly optimized for stability, solubility, and affinity for antigen. 

An additional benefit of computational screening methodology is that it is hypothesis driven (dash 
here). Thus successful strategies may be reapplied to antibodies as a whole, saving discovery cost 
10 and time. This is particularly relevant for antibodies because all antibodies share a common structural 
template and high sequence similarity, and because of the enormous amount of sequence and 
structural information available. 

It is an object of the present invention to provide design strategies for the application of computational 
1 5 screening methods to enhance the stability of antibodies, to enhance the solubility of antibodies, and 
to affinity mahire antibodies. Said design strategies describe the theoretical and/or experimental 
basis for their use, how the choice of variable positions and amino acids considered at those positions 
are carried out for their implementation, and ways in which experimental and sequence Information 
may be used. 

20 

It is a further object of the present invention to provide computational methods for the application of 
computational screening methods to enhance the stability of antibodies, to enhance the solubility of 
antibodies, and to affinity mature antibodies. These computational methods describe a broad array of 
scoring functions, optimization algorithms, and the like for implementing computer programs to 
25 optimize antibodies. The computational methods further describe ways by which computational 
output may be used to generate experimental libraries of variants for experimental validation. 

It is another object of the present invention to provide experimental methods for the application of 
computational screening technology to enhance the stability of antibodies, to enhance the solubility of 
30 antibodies, and to affinity mature antibodies. The experimental methods describe a broad array of 
molecular biology, protein production, and screening techniques that may be used to experimentally 
validate antibody variants that have been optimized for improved properties using computational 
screening methods. 

35 in accordance with the objects outlined above, the present invention provides computational 
screening methods to optimize antibodies. 

BRIEF DESCRIPTION OF THE DRAWINGS 

40 Figure 1 . Antibody structure and function. Shown is a model of a full-length human lgG1 antibody, 
constructed by combining the structure of the Campath Fab fragment (pdb accession code 1CE1), 
with the structure of the human lgG1 Fc region (pdb accession code 1DN2). The antibody is a 
homodimer of heterodimers, made up of two light chains and two heavy chains. The Ig domains that 
comprise the antibody are labeled, and include V L and C u for the light chain, and V Hl Cgammal (Cy1), 
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5 Cgamma2 (Cy2), and Cgamma3 (Cy3) for the heavy chain. Antibody regions relevant to the 

discussion are also labeled, including the variable region (Fv), the Fab region, and the Fc region. The 
regions which bind molecules or proteins relevant to the present invention are indicated, including the 
antigen binding site in the variable region, and the Fc region which binds FcyRs, FcRn, C1q, and 
proteins A and G. Campath is a registered trademark in the US of Burroughs Wellcome. 

10 

Figures 2a and 2b. Human germ line sequences and aligned antibody sequences. The sequences 
which are known to encode the human heavy chain variable region (V H ) and the human kappa light 
chain variable region (V u ) are shown aligned with four relevant antibody sequences. The germ line 
sequences were obtained from the IMGT database (IMGT, the international ImMunoGeneTics 

15 information system®; imgt.cines.fr), and aligned and numbered according to the numbering scheme 
of Chothia (Chothia et a/., 1992, J Mol. Biol. 227 \ 776-798, 799-817; Tomiinson et a/., 1 995, EMBO J. 
14:4628-4638; Williams e/a/., 1996, J. Mol. Biol. 264:220-232; Al-Lazikani et a/., 1997, J. Mol. Biol. 
273, 927-948; Chothia et a/., 1998, J. Mol. Biol. 278, 457-479; all of which are herein expressly 
incorporated by reference). The regions of the variable region are indicated above the numbering, 

20 and these include framework regions 1 through 3 (FR1, FR2, and FR3) and the complementarity 

determining regions (CDRs) 1 through 3 (CDR1, CDR2, and CDR3). As is known in the art, V H CDR3 
is not a part of the V H germ line and V L CDR3 is encoded only up to Chothia position 95 in the V L 
kappa germ line. Positions that make up CDRs are underlined. The germ line chains are grouped 
into 7 subfamilies for both V H and V L , as is known in the art, and these subfamilies are grouped 

25 together and separated by a blank line. Four antibody sequences used in the examples of the 

present invention, listed by their pdb accession codes and underlined, are shown below the subfamily 
to which they are closest in sequence. These sequences were aligned using the alignment program 
BLAST. The most similar germ line sequences to these four antibodies, as determined by this 
alignment analysis, are shown in parentheses next to the antibody code. The most similar germ line 

30 V H chains to the four antibodies are VH_3-74 for D3H44 (1JPT), VH_3-66 for Herceptin (1 FVC), 
VH_4-59 and VH>72 for Campath (1CE1), and VH7-4-1 for rhumAb VEGF (1CZ8). The most 
similar germ line V L chains to the four antibodies are VLkjlD-3 for D3H44 (1JPT), VLkJD-3 for 
Herceptin (1FVC), VLkJD-33 for Campath (1CE1), and VLkJD-33 for rhumAb VEGF (1CZ8). 
Herceptin is a registered trademark in the US owned by Genentech, Inc. 

35 

Figure 3. Antibody structures relevant to the presented examples. The seven antibody structures 
used in the present invention are listed. For each antibody is listed the target antigen, the source, the 
pdb accession code, whether the structure is a complex of the antibody with antigen (bound) or is 
uncomplexed (unbound), the resolution, and the reference. 

40 

Figure 4. Campath V H domain stabilization. The large central figure shows the Campath V H domain 
from 1CE1 as a gray ribbon diagram, with Example 1 variable position residues represented as black 
lines. The smaller figure in the upper left shows the modeled full-length antibody structure (from 
Figure 1) with the relevant domain highlighted by a box. 
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5 

Figures 5a, 5b, and 5c. Campath V H domain stabilization. Figure 5a shows the results of the 
computational screening calculations described in Example 1 . Column 1 lists the heavy (H) chain 
variable positions. Column 2 lists the amino acids considered at each variable position. The set of 
amino acids belonging to the Core classification are described in the section entitled "Selection of 

10 Amino Acids to be Considered at Each Position". Column 3 lists the WT Campath amino acid identity 
at each variable position. Column 4 lists the amino acid identity at each variable position in the DEE 
ground state sequence predicted by the computational screening calculations. Column 5 lists the set 
of amino acids at each variable position that are observed in the Monte Carlo output. Each amino 
acid is followed by its occupancy, that is the number of sequences In the 1000 sequence set that 

15 contain that amino acid at that variable position. Figures 5b and 5c show experimental libraries 

derived from the computational screening results, as described in Example 1. Column 1 lists variable 
positions and column 2 shows amino acid substitutions that are included in the experimental library. 
Figure 5c is represented combinatorially, that is the explicit library is the combination of each possible 
amino acid substitution at each variable position with ail other possible amino acid substitutions at all 

20 other positions. The complexity of the library, that is the total number of defined sequences of which it 
is composed, is shown in the bottom row. 

Figure 6. Campath V L domain stabilization. The large central figure shows the Campath V L domain 
from 1CE1 as a gray ribbon diagram, with Example 2 variable position residues represented as black 
25 lines. The smaller figure in the upper left shows the modeled full-length antibody structure with the 
relevant domain highlighted by a box. 

Figures 7a and 7b. Campath V L domain stabilization. Figure 7a shows the results of the 
computational screening calculations described in Example 2. Column 1 lists the light (L) chain 

30 variable positions. Column 2 lists the amino acids considered at each variable position. The set of 
amino acids belonging to the Core and Boundary classifications are described in the section entitled 
"Selection of Amino Acids to be Considered at Each Position". Column 3 lists the WT Campath amino 
acid identity at each variable position. Column 4 lists the amino acid identity at each variable position 
in the DEE ground state sequence predicted by the computational screening calculations. Column 5 

35 lists the set of amino acids at each variable position which are observed in the Monte Carlo output. 
Each amino acid is followed by its occupancy, that is the number of sequences in the 1000 sequence 
set that contain that amino acid at that variable position. Figure 7b shows an experimental library 
derived from the computational screening results, as described in Example 2. Column 1 lists variable 
positions and column 2 shows amino acid substitutions which are included In the experimental library. 

40 The library is represented combinatorially, that is the explicit library is the combination of each 
possible amino acid substitution at each variable position with ail other possible amino acid 
substitutions at all other positions. The complexity of the library, that is the total number of defined 
sequences of which it is composed, is shown in the bottom row. 
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Figure 8. Campath V H Cy1 domain stabilization. The large central figure shows the Campath V H Cy1 
domain from 1CE1 as a gray ribbon diagram, with Example 3 variable position residues represented 
as black lines. The smaller figure in the upper left shows the modeled full-length antibody structure 
with the relevant domain highlighted by a box. 

Figures 9a and 9b. Campath V H Cy1 domain stabilization. Figure 9a shows the results of the 
computational screening calculations described in Example 3. Column 1 lists the heavy (H) chain 
variable positions. Column 2 lists the amino adds considered at each variable position. The set of 
amino acids belonging to the Core and Boundary classifications are described in the section entitled 
"Selection of Amino Acids to be Considered at Each Position". Column 3 lists the WT Campath amino 
acid identity at each variable position. Column 4 lists the amino acid identity at each variable position 
in the DEE ground state sequence predicted by the computational screening calculations. Column 5 
lists the set of amino acids at each variable position that are observed in the Monte Carlo output. 
Each amino acid is followed by its occupancy, that is the number of sequences In the 1000 sequence 
set that contain that amino acid at that variable position. Figure 9b shows an experimental library 
derived from the computational screening results, as described in Example 3. Column 1 lists variable 
positions, and column 2 shows amino acid substitutions that are included in the experimental library. 
The library is represented combinatorially, that is the explicit library is the combination of each 
possible amino acid substitution at each variable position with ail other possible amino acid 
substitutions at ail other positions. The complexity of the library, that is the total number of defined 
sequences of which it is composed, is shown in the bottom row. 

Figure 10. Fc V H Cy2 domain stabilization. The large central figure shows the Fc V H Cy2 domain from 
1DN2 as a gray ribbon diagram, with Example 4 variable position residues represented as black lines. 
The smaller figure in the upper left shows the modeled full-length antibody structure with the relevant 
domain highlighted by a box. 

Figures 1 1 a and 1 1 b. Fc V H Cy2 domain stabilization. Figure 1 1 a shows the results of the 
computational screening calculations described in Example 4. Column 1 lists the heavy (H) chain 
variable positions. Column 2 lists the amino acids considered at each variable position. The set of 
amino acids belonging to the Core and Boundary classifications are described in the section enUWed 
"Selection of Amino Acids to be Considered at Each Position". Column 3 lists the WT Campath amino 
acid identity at each variable position. Column 4 lists the amino acid identity at each variable position 
in the DEE ground state sequence predicted by the computational screening calculations. Column 5 
lists the set of amino acids at each variable position that are observed in the Monte Carlo output. 
Each amino acid Is followed by its occupancy, that is the number of sequences In the 1000 sequence 
set that contain that amino acid at that variable position. Figure 1 1 b shows an experimental library 
derived from the computational screening results, as described in Example 4. Column 1 lists variable 
positions, and column 2 shows amino acid substitutions that are included in the experimental library. 
The library is represented combinatorially, that is the explicit library is the combination of each 
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possible amino acid substitution at each variable position with all other possible amino acid 
substitutions at afl other positions. The complexity of the library, that is the total number of defined 
sequences of which it is composed, is shown in the bottom row. 



Figure 12. Fc V H Cy3 domain stabilization. The large central figure shows the Fc V H Cy3 domain from 
10 1 DN2 as a gray ribbon diagram, with Example 5 variable position residues represented as black lines. 
The smaller figure in the upper left shows the modeled full-length antibody structure with the relevant 
domain highlighted by a box. 

Figures 13a and 13b. Fc V H Cy3 domain stabilization. Figure 13a shows the results of the 
15 computational screening calculations described in Example 5. Column 1 lists the heavy chain 

variable positions. Column 2 lists the amino acids considered at each variable position. The set of 
amino acids belonging to the Core and Boundary classifications are described in the section entitled 
"Selection of Amino Acids to be Considered at Each Position*. Column 3 lists the WT Fc amino acid 
identity at each variable position. Column 4 lists the amino acid identity at each variable position in 
20 the DEE ground state sequence predicted by the computational screening calculations. Column 5 
lists the set of amino acids at each variable position that are observed in the Monte Carlo output. 
Each amino acid is followed by its occupancy, that is the number of sequences in the 1000 sequence 
set that contain that amino acid at that variable position. Figure 13b shows an experimental library 
derived from the computational screening results, as described in Example 5. Column 1 lists variable 
25 positions, and column 2 shows amino acid substitutions that are included in the experimental library. 
The library is represented combinatorially, that is the explicit library is the combination of each 
possible amino acid substitution at each variable position with all other possible amino acid 
substitutions at all other positions. The complexity of the library, that is the total number of defined 
sequences of which it is composed, is shown in the bottom row. 

30 

Figure 14. rhumAb VEGF Vh/V l interface stabilization. The large central figure shows the rhumAb 
VEGF V H and V L domains from 1CZ8 as black and gray ribbons respectively, with Example 6 variable 
position residues represented as black lines. The smaller figure in the upper left shows the modeled 
full-length antibody structure with the relevant region highlighted by a box. 

35 

Figures 15a, 15b, and 15c. rhumAb VEGF Vh/Vl interface stabilization. Figures 15a and 15b show 
the results of the computational screening calculations described in Example 6. Column 1 lists the 
light (L) and heavy (H) chain variable positions. Column 2 lists the amino acids considered at each 
variable position. The set of amino acids belonging to the Core and Boundary classifications are 
40 described in the section entitled "Selection of Amino Acids to be Considered at Each Position". 

Column 3 lists the WT rhumAb VEGF amino acid identity at each variable position. Column 4 lists the 
amino acid identity at each variable position in the DEE ground state sequence predicted by the 
computational screening calculations. Column 5 lists the set of amino acids at each variable position 
that are observed in the Monte Carlo output. Each amino acid is followed by its occupancy, that is the 
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number of sequences in the 1000 sequence set that contain that amino acid at t that variable position. 
Figure 15c shows an experiments library derived from the computational screening results, as 
described in Example 6. Column 1 lists variable positions, and column 2 shows amino acid 
substitutions that are included in the experimental library. The library is represented combinatorially, 
that is the explicit library Is the combination of each possible amino acid substitution at each variable 
position with all other possible amino acid substitutions at all other positions. The complexity of the 
library, that is the total number of defined sequences of which it is composed, is shown in the bottom 
row. 

Figures 16a and 16b. Sequence alignment of rhumAb VEGF variable region with the human variable 
region germ line. The rhumAb VEGF V H and V L sequences are shown aligned with the sequences 
that encode the human V H (Figure 16a) and V L (Figure 16b) germ line. The germ line sequences 
were obtained from the IMGT database, and numbered according to the numbering scheme of 
Chothia. The regions of the variable region are indicated above the numbering, and these include 
framework regions 1 through 3 (FR1, FR2, and FR3) and the complementarity determining regions 
(CDRs) 1 through 3 (CDR1, CDR2, and CDR3). Positions that make up CDRs are underlined. The 7 
germ line subfamilies for V H and V L are grouped together and separated by a blank line. The rhumAb 
VEGF V H and V L sequences were aligned to the germ line sequences using the alignment program 
BLAST. rhumAb VEGF V H is most similar to the germ line chain VH_7-4-1, and rhumAb VEGF V u is 
most similar to the germ line chain VLkJ D-33. The rhumAb VEGF V H and V L sequences are 
indicated by the underlined pdb accession code 1CZ8, and shown below the subfamily to which they 
are closest in sequence. Amino acids at variable positions for Example 6 design calculations are 
shown in bold in the 1CZ8 and the germ line sequences. 

Figures 17a and17b. rhumAb VEGF sequence-guided V H N L interface stabilization. Figure 17a 
shows the results of the computational screening calculations described in Example 6. Rows 1 
through 5 list the chain (L, light chain or H, heavy chain), variable positions as defined in the 1CZ8 
structure and the according to the Chothia numbering scheme, amino acids considered at those 
positions as obtained from Figures 16a and 16b, and the amino acid at each position in the WT 
rhumAb VEGF sequence. "All" or "All 20" means that all 20 amino acids are considered at the 
variable position. The rows that follow list the amino acid identity at variable positions for the lowest 
energy sequence from each cluster group, as described in Example 6. Figure 17a is similar to Figure 
17b except that all the listed sequences are the set of sequences make up cluster group 5. 

Figure 18. Herceptin V H A/ L interface stabilization. The large central figure shows the Herceptin V H 
and V L domains from 1FVC as black and gray ribbons respectively, with Example 7 variable position 
residues represented as black lines. The smaller figure in the upper left shows the modeled full- 
length antibody structure with the relevant region highlighted by a box. 
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5 Figures 19a, 19b, 19c, and 19d. Herceptin Vh/V l interface stabilization. Figures 19a and 19c show 
the results of the computational screening calculations described in Example 7. Column 1 lists the 
light (L) and heavy (H) chain variable positions. Column 2 lists the amino acids considered at each 
variable position. The set of amino acids belonging to the Core, Surface, and Boundary 
classifications are described in the section entitled "Selection of Amino Acids to be Considered at 

10 Each Position". Column 3 lists the WT Herceptin amino acid Identity at each variable position. 
Column 4 lists the amino acid identity at each variable position in the DEE ground state sequence 
predicted by the computational screening calculations. Column 5 lists the set of amino acids at each 
variable position that are observed in the Monte Carlo output. Each amino acid is followed by its 
occupancy, that is the number of sequences in the 1000 sequence set that contain that amino acid at 

15 that variable position. Figures 19b and 19d show experimental libraries derived from the 

computational screening results, as described in Example 7. Column 1 lists variable positions, and 
column 2 shows amino acid substitutions that are included in the experimental library. The libraries 
are represented combinatorially, that is the explicit library is the combination of each possible amino 
acid substitution at each variable position with all other possible amino acid substitutions at all other 

20 positions. The complexity of the libraries, that is the total number of defined sequences of which it is 
composed, is shown in the bottom row. 

Figure 20. rhumAb VEGF C L /Cy1 interface stabilization. The large central figure shows the VEGF C L 
and Cy1 domains from 1 CZ8 as black and gray ribbons respectively, with Example 8 variable position 
25 residues represented as black lines. The smaller figure in the upper left shows the modeled full- 
length antibody structure with the relevant region highlighted by a box. 

Figures 21a and 21b. rhumAb VEGF C L /Cy1 interface stabilization. Figure 21a shows the results of 
the computational screening calculations described in Example 8. Column 1 lists the light (L) and 

30 heavy (H) chain variable positions. Column 2 lists the amino acids considered at each variable 
position. The set of amino acids belonging to the Core classifications are described in the section 
entitled "Selection of Amino Acids to be Considered at Each Position". Column 3 lists the WT rhumAb 
VEGF amino acid identity at each variable position. Column 4 lists the amino acid identity at each 
variable position in the DEE ground state sequence predicted by the computational screening 

35 calculations. Column 5 lists the set of amino acids at each variable position that are observed in the 
Monte Carlo output. Each amino acid is followed by its occupancy, that is the number of sequences 
in the 1000 sequence set that contain that amino acid at that variable position. Figure 21b shows an 
experimental library derived from the computational screening results, as described In Example 8. 
Column 1 lists variable positions, and column 2 shows amino acid substitutions that are included in 

40 the experimental library. The libraries are represented combinatorially, that is the explicit library is the 
combination of each possible amino acid substitution at each variable position with all other possible 
amino acid substitutions at all other positions. The complexity of the libraries, that is the total number 
of defined sequences of which it is composed, is shown in the bottom row. 
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5 Figure 22. Fc Cy3/Cy3 interface stabilization. The large central figure shows the Fc Cy3 domains 
from 1DN2 as gray ribbons, with Example 9 variable position residues represented as black lines. 
The smaller figure in the upper left shows the modeled full-length antibody structure with the relevant 
region highlighted by a box. 

10 Figures 23a and 23b. Fc Cy3/Cy3 interface stabilization. Figure 23a shows the results of the 
computational screening calculations described in Example 9. Column 1 lists the heavy chain 
variable positions. Chains A and B are the two symmetrical C r 3 domains in the 1 DN2 structure. 
Column 2 lists the amino acids considered at each variable position. The set of amino acids 
belonging to the Core classifications are described in the section entitled "Selection of Amino Acids to 

1 5 be Considered at Each Position". Column 3 lists the WT Fc amino acid identity at each variable 
position. Column 4 lists the amino acid identity at each variable position in the DEE ground state 
sequence predicted by the computational screening calculations. Column 5 lists the set of amino 
acids at each variable position that are observed in the Monte Carlo output. Each amino acid is 
followed by its occupancy, that is the number of sequences In the 1000 sequence set that contain that 

20 amino acid at that variable position. Figure 23b shows an experimental library derived from the 
computational screening results, as described In Example 9. Column 1 lists variable positions, and 
column 2 shows amino acid substitutions that are included in the experimental library. The libraries 
are represented combinatorially, that is the explicit library is the combination of each possible amino 
acid substitution at each variable position with all other possible amino acid substitutions at all other 

25 positions. The complexity of the libraries, that is the total number of defined sequences of which It is 
composed, is shown in the bottom row. 

Figure 24. Campath solubility optimization. The large central figure shows the Campath Fab 
fragment from 1CE1 as a gray ribbon diagram, with Example 10 variable position residues 
30 represented as black ball and sticks. The smaller figure in the upper left shows the modeled full- 
length antibody structure with the relevant region highlighted by a box. 

Figures 25a and 25b. Campath solubility optimization. Figure 25a shows the results of the 
computational screening calculations described in Example 10. Column 1 lists the heavy (H) and light 

35 (L) chain variable positions. Column 2 lists the wild type amino acid identity at each variable position. 
The remaining 20 columns indicate which of the 20 natural amino acids are favorable substitutions for 
each variable position according to the computational screening calculations. The presence of an 
amino acid in its column for a variable position indicates that the amino acid is within 1 unit of energy 
of the lowest energy substitution. Figure 25b shows an experimental library derived from the 

40 computational screening results, as described in Example 10. Column 1 lists variable positions, and 
column 2 shows amino acid substitutions that are included in the experimental library. The library Is 
represented combinatorially, I.e. the explicit library is the combination of each possible amino acid 
substitution at each variable position with all other possible amino acid substitutions at ail other 
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5 positions. The complexity of the library, that is the total number of defined sequences of which it is 
composed, is shown in the bottom row. 

Figure 26. rhumAb VEGF solubility optimization. The large central figure shows the rhumAb VEGF 
Fab fragment from 1CZ8 as a gray ribbon diagram, with Example 1 1 variable position residues 
10 represented as black ball and sticks. The smaller figure in the upper left shows the modeled full- 
length antibody structure with the relevant region highlighted by a box. 

Figures 27a and 27b. rhumAb VEGF solubility optimization. Figure 27a shows the results of the 
computational screening calculations described in Example 11. Column 1 lists the heavy (H) and light 

1 5 (L) chain variable positrons. Column 2 lists the wild type amino acid identity at each variable position. 
The remaining 20 columns indicate which of the 20 natural amino acids are favorable substitutions for 
each variable position according to the computational screening calculations. The presence of an 
amino acid in its column for a variable position indicates that the amino acid is within 1 unit of energy 
of the lowest energy substitution. Figure 27b shows an experimental library derived from the 

20 computational screening results, as described in Example 1 1 . Column 1 lists variable positions, and 
column 2 shows amino acid substitutions that are included in the experimental library. The library is 
represented combinatorially, i.e. the explicit library is the combination of each possible amino acid 
substitution at each variable position with all other possible amino acid substitutions at all other 
positions. The complexity of the library, that is the total number of defined sequences of which it is 

25 composed, is shown in the bottom row. 

Figure 28. Herceptin solubility optimization. The large central figure shows the Herceptin scFv 
fragment from 1 FVC as a gray ribbon diagram, with Example 12 variable position residues 
represented as black ball and sticks. The smaller figure in the upper left shows the modeled full- 
30 length antibody structure with the relevant region highlighted by a box. 

Figures 29a and 29b. Herceptin solubility optimization. Figure 29a shows the results of the 
computational screening calculations described in Example 12. Column 1 lists the heavy (H) and light 
(L) chain variable positions. Column 2 lists the wild type amino acid identity at each variable position. 

35 The remaining 20 columns indicate which of the 20 natural amino acids are favorable substitutions for 
each variable position according to the computational screening calculations. The presence of an 
amino acid in its column for a variable position indicates that the amino acid is within 1 unit of energy 
of the lowest energy substitution. Figure 29b shows an experimental library derived from the 
computational screening results, as described in Example 12. Column 1 lists variable positions, and 

40 column 2 shows amino acid substitutions that are included in the experimental library. The library is 
represented combinatorially, i.e. the explicit library is the combination of each possible amino acid 
substitution at each variable position with all other possible amino acid substitutions at ail other 
positions. The complexity of the library, that is the total number of defined sequences of which it is 
composed, Is shown in the bottom row. 



13 



WO 03/074679 



PCT/US03/06598 



5 

Figure 30. Fc solubility optimization. The large central figure shows the Fc region from 1DN2 as a 
gray ribbon diagram, with Example 13 variable position residues represented as black ball and sticks. 
The smaller figure In the upper left shows the modeled full-length antibody structure with the relevant 
region highlighted by a box. 

10 

Figures 31a and 31 b. Fc solubility optimization. Figure 31 a shows the results of the computational 
screening calculations described in Example 13. Column 1 lists the heavy chain variable positions for 
the A chain, i.e. for only one of the Cy2-Cy3 heavy chains of the homodimer. Column 2 lists the wild 
type amino acid identity at each variable position. The remaining 20 columns indicate which of the 20 

t5 natural amino acids are favorable substitutions for each variable position according to the 

computational screening calculations. The presence of an amino acid In its column for a variable 
position indicates that the amino acid is within 1 unit of energy of the lowest energy substitution. 
Figure 31b shows an experimental library derived from the computational screening results, as 
described in Example 13. Column 1 lists variable positions, and column 2 shows amino acid 

20 substitutions that are included in the experimental library. The library is represented combinatorially, 
i.e. the explicit library is the combination of each possible amino acid substitution at each variable 
position with all other possible amino acid substitutions at all other positions. The complexity of the 
library, that is the total number of defined sequences of which it is composed, is shown in the bottom 
row. 

25 

Figure 32. rhumAb VEGF affinity maturation. The large central figure shows the 1CZ8 rhumAb VEGF 
V H and V L domains as gray ribbons bound to the VEGF target antigen as black ribbon, with Example 
14 variable position residues represented as black lines. The smaller figure in the upper left shows 
the modeled full-length antibody structure with the relevant region highlighted by a box. 

30 

Figures 33a and 33b. rhumAb VEGF affinity maturation. Figure 33a shows the results of the 
computational screening calculations described in Example 14. Column 1 lists the light (L) and heavy 
(H) chain variable positions. Column 2 lists the amino acids considered at each variable position. 
The set of amino acids belonging to the Core, Surface, and Boundary classifications are described in 

35 the section entitled "Selection of Amino Acids to be Considered at Each Position". Column 3 lists the 
WT rhumAb VEGF amino acid identity at each variable position. Column 4 lists the amino acid 
identity at each variable position in the DEE ground state sequence predicted by the computational 
screening calculations. Column 5 lists the set of amino acids at each variable position that are 
observed in the Monte Carlo output. Each amino acid is followed by its occupancy, that is the number 

40 of sequences In the 1000 sequence set that contain that amino acid at that variable position. Figure 
33b shows an experimental library derived from the computational screening results, as described in 
Example 14. Column 1 lists variable positions, and column 2 shows amino acid substitutions that are 
included in the experimental library. The libraries are represented combinatorially, that is the explicit 
library is the combination of each possible amino acid substitution at each variabie position with all 
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5 other possible amino acid substitutions at all other positions. The complexity of the libraries, that is 
the total number of defined sequences of which It is composed, is shown in the bottom row. 
Figure 34. rhumAb VEGF affinity maturation. The large central figure shows the 1CZ8 rhumAb VEGF 
V H and V L domains as gray ribbons bound to the VEGF target antigen shown as black ribbon, with 
Example 14 variable position residues represented as black lines. The smaller figure in the upper left 

1 0 shows the modeled full-length antibody structure with the relevant region highlighted by a box. 

Figures 35a and 35b. rhumAb VEGF affinity maturation. Figure 35a shows the results of the 
computational screening calculations described in Example 14. Column 1 lists the light (L) and heavy 
(H) chain variable positions. Column 2 lists the amino acids considered at each variable position. 

1 5 The set of amino acids belonging to the Core, Surface, and Boundary classifications are described in 
the section entitled "Selection of Amino Acids to be Considered at Each Position". Column 3 lists the 
WT rhumAb VEGF amino acid identity at each variable position. Column 4 lists the amino acid 
identity at each variable position in the DEE ground state sequence predicted by the computational 
screening calculations. Column 5 lists the set of amino acids at each variable position that are 

20 observed in the Monte Carlo output. Each amino acid is followed by its occupancy, that is the number 
of sequences in the 1000 sequence set that contain that amino add at that variable position. Figure 
35b shows an experimental library derived from the computational screening results, as described in 
Example 14. Column 1 lists variable positions, and column 2 shows amino acid substitutions that are 
included in the experimental library. The libraries are represented combinatoriaily, that is the explicit 

25 library is the combination of each possible amino acid substitution at each variable position with all 
other possible amino acid substitutions at all other positions. The complexity of the libraries, that is 
the total number of defined sequences of which it is composed, is shown in the bottom row. 

Figure 36. SM3 affinity maturation. The large central figureshows the 1SM3 V H and V L domains as 
30 gray ribbons bound to the MUC1 antigen shown as black ribbon, with Example 1 5 variable position 
residues represented as black lines. The smaller figure in the upper left shows the modeled full- 
length antibody structure with the relevant region highlighted by a box. 

Figures 37a, 37b, and 37c. SM3 affinity maturation. Figures 37a and 37b show the results of the 
35 computational screening calculations described in Example 1 5. Column 1 lists the light (L) and heavy 
(H) chain variable positions. Column 2 lists the amino acids considered at each variable position. 
The set of amino acids belonging to the Core, Surface, and Boundary classifications are described in 
the section entitled "Selection of Amino Acids to be Considered at Each Position". Column 3 lists the 
WT SM3 amino acid identity at each variable position. Column 4 lists the amino acid identity at each 
40 variable position in the DEE ground state sequence predicted by the computational screening 

calculations. Column 5 lists the set of amino acids at each variable position that are observed in the 
Monte Carlo output. Each amino acid is followed by its occupancy, that is the number of sequences 
in the 1000 sequence set that contain that amino acid at that variable position. Figure 37c shows an 
experimental library derived from the computationai screening results, as described in Example 15. 
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Column 1 lists variable positions, and column 2 shows amino acid substitutions that are included in 
the experimental library. The libraries are represented combinatorially, that is the explicit library is the 
combination of each possible amino acid substitution at each variable position with all other possible 
amino acid substitutions at all other positions. The complexity of the libraries, that is the total number 
of defined sequences of which it is composed, is shown in the bottom row. 

Figure 38. Campath affinity maturation. The large central figure shows the 1CE1 V H and V L domains 
as gray ribbons bound to the CD52 antigen shown as black ribbon, with Example 16 variable position 
residues represented as black lines. The smaller figure in the upper left shows the modeled full- 
length antibody structure with the relevant region highlighted by a box. 



Figures 39a and 37b. Campath affinity maturation. Figure 39a shows the results of the computational 
screening calculations described in Example 16. Column 1 lists the light (L) and heavy (H) chain 
variable positions. Column 2 lists the amino acids considered at each variable position. The set of 
amino acids belonging to the Core, Surface, and Boundary classifications are described in the section 

20 entitled "Selection of Amino Acids to be Considered at Each Position". Column 3 lists the WT 

Campath amino acid identity at each variable position. Column 4 lists the amino acid identity at each 
variable position in the DEE ground state sequence predicted by the computational screening 
calculations. Column 5 lists the set of amino acids at each variable position that are observed in the 
Monte Carlo output. Each amino acid is followed by its occupancy, that is the number of sequences 

25 in the 1 000 sequence set that contain that amino acid at that variable position. Figure 39b shows an 
experimental library derived from the computational screening results, as described in Example 16. 
Column 1 lists variable positions, and column 2 shows amino acid substitutions that are included in 
the experimental library. The libraries are represented combinatorially, that is the explicit library is the 
combination of each possible amino acid substitution at each variable position with all other possible 

30 amino acid substitutions at all other positions. The complexity of the libraries, that is the total number 
of defined sequences of which it is composed, is shown in the bottom row. 

Figures 40a and 40b. Sequence alignment of Campath variable region with the human variable 
region germ line. The Campath V H and V L sequences are shown aligned with the sequences that 

35 encode the human V H (Figure 40a) and V L (Figure 40b) germ line. The germ line sequences were 
obtained from the IMGT database, and numbered according to the numbering scheme of Chothia. 
The regions of the variable region are indicated above the numbering, and these include framework 
regions 1 through 3 (FR1, FR2, and FR3) and the complementarity determining regions (CDRs) 1 
through 3 (CDR1 , CDR2, and CDR3). Positions that make up CDRs are underlined. The 7 germ line 

40 subfamilies for V H and V L are grouped together and separated by a blank line. The Campath V H and 
V L sequences were aligned to the germ line sequences using the alignment program BLAST. 
Campath V H is most similar to the germ line chain VH_4-59 and VH_3-72, and Campath V L is most 
similar to the germ line chain VLkJlD-33. The Campath V H and V L sequences are indicated by the 
underlined pdb accession code 1CE1 , and shown below the subfamily to which they are closest in 



16 



WO 03/074679 



PCT/US03/06598 



5 sequence. Amino acids at variable positions for Example 16 design calculations are shown in bold in 
the 1CE1 and the germ line sequences. 

Figures 41a and 41b. Campath sequence-guided affinity maturation. Figure 41a shows the results of 
the computational screening calculations described in Example 16. Rows 1 through 3 list the light (L) 

10 or heavy (H) chain variable positions, as defined in the 1CE1 structure, and the according to the 

Chothia numbering scheme. Row 4 lists the amino acids considered at variable positions as obtained 
from Figures 40a and 40b, and row 5 lists the amino acid at each position in the WT Campath 
sequence. "Air or "All 20" means that all 20 amino acids are considered at the variable position. The 
rows that follow list the amino acid identity at variable positions for the lowest energy sequence from 

15 each cluster group, as described in Example 16. Figure 41b is similar to Figure 41a except that all the 
listed sequences are the set of sequences make up cluster groups 4 and 9. 

Figure 42. D3H44 affinity maturation. The large central figure shows the UPS V H and V L domains as 
gray ribbons bound to the tissue factor antigen shown as black ribbon, with Example 16 variable 
20 position residues represented as black lines. The smaller figure in the upper left shows the modeled 
full-length antibody structure with the relevant region highlighted by a box. 

Figures 43a, 43b # 43c, and 43d. D3H44 affinity maturation. Figures 43a and 43b show the results of 
the computational screening calculations using the UPS template and 1JPT template respectively, as 

25 described in Example 17. Column 1 lists the light (L) and heavy (H) chain variable positions. Column 
2 lists the amino acids considered at each variable position. The set of amino acids belonging to the 
Core, Surface, and Boundary classifications are described in the section entitled "Selection of Amino 
Acids to be Considered at Each Position". Column 3 lists the WT D3H44 amino acid identity at each 
variable position. Column 4 lists the amino acid identity at each variable position in the DEE ground 

30 state sequence predicted by the computational screening calculations. Column 5 lists the set of 

amino acids at each variable position that are observed in the Monte Carlo output. Each amino acid 
is followed by its occupancy, that is the number of sequences in the 1000 sequence set that contain 
that amino acid at that variable position. Figures 43c and 43d show an experimental library derived 
from the computational screening results, as described in Example 17. In Figure 43c, column 1 lists 

35 variable positions, and columns 2 and 3 show amino acid substitutions, which are included in the 

experimental library. In Figure 43d, column 1 lists variable positions, and column 2 shows amino acid 
substitutions that are included in the experimental library. The libraries are represented 
combinatorially, that is the explicit library is the combination of each possible amino acid substitution 
at each variable position with all other possible amino acid substitutions at all other positions. The 

40 complexity of the libraries, that is the total number of defined sequences of which it is composed, is 
shown in the bottom row. 

Figure 44. Herceptin affinity maturation. The large central figure shows the 1FVC V H and V L domains 
as black and gray ribbons respectively, with Example 18 variable position residues represented as 
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5 black lines. The smaller figure in the upper left shows the modeled full-length antibody structure with 
the relevant region highlighted by a box. 

Figures 45a and 45b. Herceptin affinity maturation. Figure 45a shows the results of the 
computational screening calculations described in Example 18. Column 1 lists the light (L) and heavy 

10 (H) chain variable positions. Column 2 lists the amino acids considered at each variable position. 
The set of amino acids belonging to the Core, Surface, and Boundary classifications are described in 
the section entitled "Selection of Amino Acids to be Considered at Each Position". Column 3 lists the 
WT Herceptin amino acid identity at each variable position. Column 4 lists the amino acid identity at 
each variable position in the DEE ground state sequence predicted by the computational screening 

1 5 calculations. Column 5 lists the set of amino acids at each variable position that are observed in the 
Monte Carlo output. Each amino acid is followed by its occupancy, that is the number of sequences 
in the 1000 sequence set that contain that amino acid at that variable position. Figure 45b shows an 
experimental library derived from the computational screening results, as described in Example 18. 
Column 1 lists variable positions, and column 2 shows amino acid substitutions that are included in 

20 the experimental library. The libraries are represented combinatorially, that is the explicit library is the 
combination of each possible amino acid substitution at each variable position with all other possible 
amino acid substitutions at all other positions. The complexity of the libraries, that Is the total number 
of defined sequences of which it is composed, is shown in the bottom row. 

25 DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to the use of a variety of computational methods to alter physico- 
chemical properties of antibodies, to allow the virtual screening of large numbers of potential variants 
to arrive at sets that exhibit desirable properties as compared to the starting antibody or antibodies. 
30 The computational analyses can be done as a single step, with the resulting set being experimentally 
generated and tested in the desired assay, for improved function and properties. Similarly, the 
original set can be additionally computationally manipulated to create a new library which then itself 
can be experimentally tested. 

35 The invention finds use in the prescreening of variant antibody libraries; that is, computational 

screening for stability (or other properties) may be done on either the entire protein or some subset of 
residues, as desired and described below. By using computational methods to generate a threshold or 
cutoff to eliminate disfavored sequences, the percentage of useful variants in a given variant set size 
can increase, and the required experimental outlay is decreased. 

40 

In order that the invention may be more completely understood, several definitions are set forth below. 
By "affinity maturation" herein is meant the process of enhancing the affinity of an antibody for its 
antigen. Methods for affinity maturation include but are not limited to computational screening 
methods and experimental methods. By " antibody" herein is meant a protein consisting of one or 



18 



WO 03/074679 



PCT/US03/06598 



more polypeptides substantially encoded (defined below) by all or part of the recognized antibody 
genes. The recognized immunoglobulin genes include, but are not limited to, the kappa, lambda, 
alpha, gamma (lgG1, lgG2, lgG3, and lgG4), delta, epsilon and mu constant region genes, as well as 
the myriad immunoglobulin variable region genes. Antibody herein is meant to include full-length 
antibodies and antibody fragments, and include antibodies that exist naturally in any organism or are 
engineered (e.g. are variants). By "antibody fragment" is meant any form of an antibody other than 
the full-length form. Antibody fragments herein include antibodies that are smaller components that 
exist within full-length antibodies, and antibodies that have been engineered. Antibody fragments 
include but are not limited to Fv, Fc, Fab, and (Fab') 2l single chain Fv (scFv), diabodies, triabodies, 
tetrabodies, bifunctional hybrid antibodies, and the like (Maynard & Georglou, 2000, Annu. Rev. 
Biomed. Eng. 2:339-76; Hudson, 1998, Cum Opin. Biotechnoi. 9:395-402). By "amino acid" and 
"amino acid identity" as used herein is meant one of the 20 naturally occurring or any non-natural 
analogues that may be present at a specific, defined position. By "computational screening method" 
herein is meant any method for designing one or more mutations in a protein, wherein said method 
utilizes a computer to evaluate the energies of the interactions of potential amino acid side chain 
substitutions with each other and/or with the rest of the protein. By "experimental library" herein is 
meant a list of one or more protein variants, existing either as a list of amino acid sequences or a list 
of the nucleotides sequences encoding them. Description of an experimental library may be defined, 
meaning that variant sequences are expressly described. Description of an experimental library may 
also be combinatorial, meaning that possible amino acid identities are indicated at variable positions, 
and the combination of ail possibilities at all variable positions results in an expanded, explicitly 
defined library. By "Fc tt herein is meant the polypeptides of an antibody that are comprised of 
immunoglobulin domains Cgamma2 and Cgamma3 (Cy2 and Cy3). Fc may also include any residues 
which exist in the N-terminal hinge between Cy2 and Cgammal (Cy1). These regions are shown in 
Figure 1 . Fc may refer to this region in isolation, or this region in the context of an antibody or 
antibody fragment. By "full-length antibody " herein is meant the structure that constitutes the natural 
biological form of an antibody. In most mammals, including humans, and mice, this form is a tetramer 
and consists of two identical pairs of two immunoglobulin chains, each pair having one light and one 
heavy chain, each light chain comprising immunoglobulin domains V L and C L , and each heavy chain 
comprising immunoglobulin domains V H , Cy1, Cy2, and Cy3. In each pair, the light and heavy chain 
variable regions (V L and V H ) are together responsible for binding to an antigen, and the constant 
regions (C L ,Cy1 , Cy2, and Cy3, particularly Cy2, and Cy3) are responsible for antibody effector 
functions. In some mammals, for example in camels and llamas, full-length antibodies may consist of 
only two heavy chains, each heavy chain comprising immunoglobulin domains V H , Cy2, and Cy3. By 
"immunoglobulin (lay herein is meant a protein consisting of one or more polypeptides substantially 
encoded by immunoglobulin genes. Immunoglobulins include but are not limited to antibodies. 
Immunoglobulins may have a number of structural forms, including but not limited to full-length 
antibodies, antibody fragments, and individual immunoglobulin domains including but not limited to 
Vh, Cy1 , Cy2, Cy3, V L , and C L . By "immunoglobulin (Ig) domain " herein is meant a protein domain 
consisting of a polypeptide substantially encoded by an immunoglobulin gene. Ig domains include but 
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are not limited to V H . Cy1 , Cy2, Cy3, V Ll and C L as is shown in Figure 1 . By "position" as used herein 
is meant a location in the sequence of a protein. Positions are typically, but not always, numbered 
sequentially. For example, position 297 is a position in the human antibody lgG1 . By "residue" as 
used herein is meant a position in a protein and its associated amino acid Identity. For example, 
Asparagine 297 (or Asn297 or N297) Is a residue in the human antibody lgG1 . By "variant protein 
sequence " as used herein is meant a protein sequence that has one or more residues that differ in 
amino acid identity from another similar protein sequence. Said similar protein sequence may be the 
natural wild type protein sequence, or another variant of the wild type sequence. In general, a starting 
sequence is referred to as a "parent" sequence, and again may either be a wild type or variant 
sequence. For example, preferred embodiments of the present invention may utilized humanized 
parent sequences upon which computational analyses are done. By "variable region" of an antibody 
herein is meant a polypeptide or polypeptides composed of the V H immunoglobulin domain, the V L 
immunoglobulin domains, or the V H and Vl immunoglobulin domains as is shown in Figure 1 
(including variants). Variable region may refer to this or these polypeptides in isolation, as an Fv 
fragment, as an scFv fragment, as this region in the context of a larger antibody fragment, or as this 
region in the context of a full-length antibody. 

The present invention may be applied to antibodies obtained from a wide range of sources. The 
antibody may be substantially encoded by an antibody gene or antibody genes from any organism, 
including but not limited to humans, mice, rats, rabbits, camels, llamas, dromedaries, monkeys, 
particularly mammals and particularly human and particularly mice and rats. In a preferred 
embodiment, the antibody is fully human, obtained for example using transgenic mice or other 
animals (Bruggemann & Taussig, 1997, Curr. Opin. BiotechnoL 8:455-458) or human antibody 
libraries coupled with selection methods (Griffiths & Duncan, 1998, Curr. Opin. BJotechnoL 9:102- 
108). The antibody does not necessarily need to be naturally occurring. For example the present* 
invention could be used to optimize an engineered antibody, including but not limited to chimeric 
antibodies and humanized antibodies (Clark, 2000, Immunol Today 21:397-402). In addition, the 
antibody being optimized may be an engineered variant of an antibody that is substantially encoded 
by one or more natural antibody genes. For example, in a one embodiment the antibody being 
optimized is an antibody that has been affinity matured. 

In general, the computationally generated antibody genes of the present invention are designed to be 
substantially encoded by a naturally occurring antibody gene such as a humanized antibody gene. 
"Substantially encoded" can include a number of components, including host cell codon usage and 
complementarity to wild type genes. For example, in one embodiment, "substantially encoded" can 
be defined as the ability of the computationally generated gene being sufficiently complementary to 
the wild type gene (or its complement, depending on sense and antisense considerations) such that 
hybridization can occur. This complementarity need not, and is preferably not perfect; that is, due to 
the alteration of the variable residues, there are a number of substitutions (and sometimes insertions 
or deletions) between the two sequences that result in differences between the sequences. However, 



20 



WO 03/074679 



PCT/US03/06598 



5 if the number of mutations is so great that no hybridization can occur under even the least stringent of 
hybridization conditions, the sequence is not a complementary sequence. Thus, by "substantially 
complementary herein is meant that the sequences are sufficiently complementary to each other to 
hybridize under the selected reaction conditions. High stringency conditions are known in the art; see 
for example Maniatis et al., Molecular Cloning: A Laboratory Manual, 2d Edition, 1989, and Short 

10 Protocols in Molecular Biology, ed. Ausubel, et al., both of which are hereby Incorporated by 
reference. Stringent conditions are sequence-dependent and will be different in different 
circumstances. Longer sequences hybridize specifically at higher temperatures. An extensive guide 
to the hybridization of nucleic acids is found in Tijssen, Techniques in Biochemistry and Molecular 
Biology-Hybridization with Nucleic Acid Probes, "Overview of principles of hybridization and the 

15 strategy of nucleic acid assays" (1993). Generally, stringent conditions are selected to be about 5- 
10 C lower than the thermal melting point (T m) for the specific sequence at a defined ionic strength 
pH. The Tm is the temperature (under defined ionic strength, pH and nucleic acid concentration) at 
which 50% of the probes complementary to the target hybridize to the target sequence at equilibrium 
(as the target sequences are present in excess, at Tm, 50% of the probes are occupied at 

20 equilibrium). Stringent conditions will be those in which the saJt concentration is less than about 1.0 M 
sodium ion, typically about 0.01 to 1 .0 M sodium ion concentration (or other salts) at pH 7.0 to 8.3 and 
the temperature is at least about 30 C for short probes (e.g. 10 to 50 nucleotides) and at least about 
60 C for long probes (e.g. greater than 50 nucleotides). Stringent conditions may also be achieved 
with the addition of destabilizing agents such as formamide. In another embodiment, less stringent 

25 hybridization conditions are used; for example, moderate or low stringency conditions may be used, 
as are known in the art; see Maniatis and Ausubel, supra, and Tijssen, supra. 

In another embodiment, "substantially encoded" means that at least a significant portion of the gene is 
identical to the parent gene such as a humanized or human antibody. In preferred embodiments, 
30 there are large areas of perfect complementarity punctuated by the variant positions which may be 
different. In preferred embodiments, at least 75% of the total gene is encoded by the parent gene, 
with at least 85%, 90%, 95% and 98% being preferred. 

The present invention may be applied to a wide range of antibody structural forms. For example, the 
35 antibody may be a full-length antibody, an antibody fragment, an Fc region, a variable region, an 
individual immunoglobulin domain, or a structural motif, site, or loop of an antibody. The antibody 
may comprise more than one protein chain. That is, the antibody may be an oligomer, including a 
homo- or hetero-oligomer. 

40 The present invention may be applied to a wide range of antibody products. In one embodiment the 
antibody product is a therapeutic, a diagnostic, or a research reagent. In a preferred embodiment the 
antibody product is a therapeutic antibody which may be used to treat disease, such diseases 
including, but not limited to cancer, autoimmune disease, cardiovascular disease, and the like. The 
antibody product may find use in a composition that is monoclonal or polyclonal, and that could be 
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5 injected intravenously, subcutaneously, intramuscularly, and the like, as well as inhaled, applied 
topically, or via an oral dosage form, or otherwise administered. In an alternate embodiment, the 
antibody product is a library that could be screened experimentally, for example to generate 
antibodies against a target antigen using a selection method as described herein, or to affinity mature 
a particular antibody. This library may be a theoretical library, that is a list of nucleic acid or amino 
1 0 acid sequences, or may be a physical library of nucleic acids or proteins that encode the library 
sequences. 

Computational Screening Methodology 

A three-dimensional structure of an antibody is used as the starting point of the computational 
15 screening method of the present invention. The positions to be optimized are identified, which may be 
the entire antibody sequence or subset(s) thereof. Amino acids that will be considered at each 
position are selected. In a preferred embodiment, each considered amino acid may be represented 
by a discrete set of allowed conformations, called rotamers. Interaction energies are calculated 
between each considered amino acid and 1) each other considered amino acid, and 2) the rest of the 
20 protein, including the protein backbone and invariable residues. In a preferred embodiment, 

interaction energies are calculated between each considered amino acid side chain rotamer and 1 ) 
each other considered amino acid side chain rotamer and 2) the rest of the protein, including the 
protein backbone and invariable residues. One or more combinatorial search algorithms are then 
used to identify the lowest energy sequence and/or low energy sequences that will comprise an 
25 experimental library. 

In a preferred embodiment, the computational screening method used to optimize antibodies is 
Protein Design Automation® (PDA™) technology, as is described in US 6,188,965; 6,269,312; and 
6,403,31 2; USSNs 09/782,004; 09/927,790; and 1 0/21 8,1 02r PCTs 98/07254; 01/40091 ; and 

30 02/25588, all of which are expressly incorporated herein by reference. In another preferred 

embodiment, a Sequence Prediction Algorithm (SPA) is used to design proteins that are compatible 
with a known protein backbone structure as is described in Raha, et a/., 2000, Protein Sci. 9:1 106- 
1119, USSNs 09/877,695 and 10/071,859, all expressly incorporated herein by reference. In some 
embodiments, combinations of different computational screening methods are used, including 

35 combinations of PDA™ and SPA, as well as combinations of these computational techniques in 

combination with sequence and structural alignment. Similarly, these computational methods can be 
used simultaneously or sequentially, in any order. Furthermore, these computational methods can be 
used with experimental methods (shuffling, error-prone PCR, etc.) as outlined below. It is also 
important to note that reiterative cycles are included; thus for example, a first computational step may 

40 be done, followed by some experimental techniques, followed by additional computational techniques. 

Computational screening, viewed broadly, has four steps: 1) selection and preparation of the 
antibody template or templates, 2) selection of variable positions and considered amino acids at those 
positions, and in a preferred embodiment selection of rotamers to model amino acids, 3) energy 



22 



WO 03/074679 



PCT/US03/06598 



5 calculation, and 4) combinatorial optimization. As will be appreciated by those skilled in the art, 
energy calculation and combinatorial optimization are the computationally intensive aspects of 
computational screening, and together these two steps are referred to as design calculations. 

Selection and Preparation of the Antibody Template 
10 By "template antibody" herein is meant the structural coordinates of part or all of an antibody to be 
optimized. The template antibody is used as input in the computational screening calculations. A 
template protein may be part or all of any protein that has a known structure or for which a structure 
may be calculated, estimated, modeled or determined experimentally. 

15 The template protein may be any antibody for which a three dimensional structure (that is, three 
dimensional coordinates for a set of the protein's atoms) is known or may be generated. The three 
dimensional structures of antibodies may be determined using methods including but not limited to X- 
ray crystallographic techniques, nuclear magnetic resonance (NMR) techniques, de novo modeling, 
and homology modeling. Antibody/antigen complexes may also be obtained using docking methods. 

20 Suitable antibody structures include, but are not limited to, all of those found in the Protein Data Base 
compiled and serviced by the Research Collaborator for Structural Bioinfomnatics (RCSB, formerly 
the Brookhaven National Lab). 

As will be appreciated by those skilled in the art, antibodies are a family of proteins that are closely 
25 related in sequence and structure. Consequently, homology models, which are generated using 

available sequence and structure information from other antibodies, are often of high quality. Thus, if 
optimization is desired for an antibody for which the structure has not been solved experimentally, a 
suitable structural model may be generated that may serve as the template for design calculations. 
Methods for generating homology models are known in the art. Methods for generating homology 
30 models of proteins are known in the art, and these methods find use in the present invention. See for 
example, Luo, ef a/. 2002, Protein ScL 1 1 : 121 8-1226, Lehmann & Wyss, 2001 , Curr. Opin. 
Biotechnol. 12(4):371-5.; Lehmann ef a/., 2000, Biochim BiophysActa. 1543(2):408-415; Rath & 
Davidson, 2000, Protein Sc/., 9(12):2457-69; Lehmann ef a/., 2000, Protein Eng.13(1):49-57; 
Desjarlais & Berg, 1993, Proc Natl Acad Sci USA. 90(6):2256-60; Desjarlais & Berg, 1992, Proteins. 
35 12(2): 101 -4; Henikoff & Henikoff, 2000, Adv. Protein Chem. 54:73-97; Henikoff & Henikoff, 1994, J. 
MoL Biol. 243(4):574-8; all herein expressly incorporated by reference. Methods for generating 
homology models of antibodies in particular are described in Morea ef a/., 2000, Methods 20:267-269, 
all herein expressly incorporated by reference, 

40 As discussed above, the template may comprise any of a number of antibody structural forms. The 
template used in antibody design calculations may comprise an entire full-length antibody, a subset of 
an antibody such as a fragment, an individual immunoglobulin domain, or a structural motif, site, or 
loop of an antibody. The template antibody may comprise more than one protein chain, and may be 
the complex of an antibody bound to its antigen or to an antibody receptor. The template may 
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5 additionally contain nonprotein components, including but not limited to small molecules, substrates, 
cofactors, metals, water molecules, prosthetic groups, polymers and carbohydrates. As will be 
appreciated by those in the art, the target antigen of an antibody may be a protein or a non-protein 
molecule. In a preferred embodiment, the structural template is a plurality or set of template proteins, 
for example or an ensemble of structures such as those obtained from NMR. Alternatively, the set of 
10 antibody templates is generated from a set of related proteins or structures, or artificially created 
ensembles. 

The protein template may be modified or altered prior to design calculations. A variety of methods for 
template preparation are described in US 6,188,965; 6,269,312; and 6,403,312; USSNs 09/782,004; 

1 5 09/927,790; 09/877,695; 1 0/071 ,859 and 10/218,102; PCTs 98/07254; 01/40091 ; and 02/25588, all of 
which are herein expressly incorporated by reference. For example, in a preferred embodiment, 
explicit hydrogens may be added if not included within the structure. In a preferred embodiment, 
energy minimization of the structure is run to relax strain, including strain due to van der Waals 
clashes, unfavorable bond angles, and unfavorable bond lengths. Alternatively, the protein template 

20 is altered using other methods, such as manually, including directed or random perturbations. It is 
also possible to modify the protein template during later steps of a design calculation, including during 
the energy calculation and combinatorial optimization steps. In an alternate embodiment, the protein 
template is not modified before or during design calculations. 

25 Selection of Variable Positions and Considered Amino Acids 
Selection of Variable, Floated, and Fixed Positions 

As is known in the art, it may be beneficial to reduce the complexity of a calculation by allowing 
mutation only at certain variable positions. By "variable position" herein is meant a position at which 
the amino acid identity is allowed to be altered in a design calculation. In a preferred embodiment the 
30 amino acid identity to which a position may be mutated is the full set or a subset of the 20 naturally 
occurring amino acids. Alternatively, variable positions may be allowed to mutate to a set of non- 
naturally occurring amino acids or synthetic analogs. One or more residues may be variable positions 
in design calculations. 

35 Residues that are chosen as variable positions may be those that contribute to or are hypothesized to 
contribute to the antibody property to be optimized. For the present invention, these properties 
include stability, solubility, and affinity for antigen. Residues at variable positions may contribute 
favorably or unfavorably to a specific antibody property. For example, a residue at the antibody/ 
antigen interface may be involved in mediating binding with antigen, and thus this position may be 

40 varied in design calculations aimed at improving affinity with antigen. Alternatively, as another 

example, a residue which has an exposed hydrophobic side chain may be responsible for causing 
unfavorable aggregation, and thus this position may be varied in design calculations aimed a 
improving solubility. 
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5 Thus in one embodiment, variable positions may be those positions that are directly involved in 

interactions that are determinants of an antibody property. For example, the antigen binding site of an 

antibody may be defined to Include all residues that contact antigen. By "contact* herein is meant 

some chemical interaction between at least one atom of an antibody residue with at least one atom of 

« 

the bound antigen, with chemical interaction including, but not limited to van der Waals interactions, 
10 hydrogen bond interactions, electrostatic interactions, and hydrophobic interactions. In an alternative 
embodiment, variable positions may include those positions that are indirectly involved in an antibody 
property, i.e. such positions may be proximal to residues that contribute to an antibody property. For 
example, the antigen binding site of an antibody may be defined to include all residues within a certain 
distance, for example 4 - 10 A, of the residues that are in van der Waals contact with antigen. Thus 
15 variable positions in this case may be chosen not only as residues that directly contact antigen, but 
also those that contact residues that contact antigen and thus influence antigen binding indirectly. 
The specific positions chosen are dependent on the design strategy being employed. 

In a preferred embodiment, some of the residue positions that are not variable are floated. By "floated 
20 position" herein is meant a position at which the amino acid conformation but not the amino acid 

identity is allowed to vary in a protein design calculation. In one embodiment the floated position may 
have the wild type amino acid identity. For example, floated positions may be wild type positions that 
are within a small distance of, for example, 5 A, of a variable position residue. In an alternate 
embodiment, a floated position may have a non-wild type amino acid identity. Such an embodiment 
25 may find use in the present invention, for example, when the goal is to evaluate the energetic or 
structural outcome of a specific mutation. 

Residue positions that are not variable or floated are fixed. By Tixed position" herein Is meant a 
position at which the amino acid identity and the conformation are held constant in a protein design 

30 calculation. Residues, which may be fixed, may include residues that are not involved or not thought 
to be involved in the property to be optimized. In this case there is nothing to be gained by varying 
these positions. Residues that may be fixed may also include but are not limited to residues that are 
important for maintaining proper folding, structure, stability, solubility, and biological function. For 
example, residues that interact with protein receptors or residues that are glycosylation sites may be 

35 fixed In design calculations to ensure that receptor binding and proper glycosylation respectively are 
not perturbed. Likewise, if stability is being optimized, it may be beneficial to fix residues that directly 
or indirectly interact with antigen so that antigen binding is not perturbed. Fixed positions may also 
include structurally important residues such as cysteines participating in disulfide bridges, residues 
critical for backbone conformation such as proline or glycine, critical hydrogen bonding residues, and 

40 residues that form favorable packing interactions. 

Selection of Amino Acids to be Considered at Each Position 

The next step In the computational screening method of the present invention is to select a set of 
possible amino acid identities that will be considered at each particular variable position. This set of 
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5 possible amino acids is herein referred to as "considered amino acids" at a variable position. In one 
embodiment, ail 20 amino acids (or their analogues or synthetic amino adds) are considered at a 
given variable position. Alternatively, a subset of amino acids, or even only one amino acid is 
considered at a given variable position. As will be appreciated by those skilled in the art, there is a 
computational benefit to considering only certain amino acid identities at variable positions, as it 

10 decreases the combinatorial complexity of the search. Furthermore, considering only certain amino 
acids at variable positions may be used to tune calculations toward specific design strategies. For 
example, for solubility optimization, it may be beneficial to allow only polar amino acids to be 
considered at surface exposed variable positions. In a preferred embodiment for solubility, at least 
one antibody sequence possesses an increase in polar character. Alternatively preferred, is selecting 

1 5 at least one nonpolar amino acid and substituting said nonpolar amino acid with a polar amino acid. 

A wide variety of methods may be used, alone or in combination, to select which amino acids will be 
considered at each position, including but not limited to those discussed below. 

20 For example, as is known in the art, the set of amino acids allowed at variable positions may be 

chosen based on the degree of exposure to solvent. Hydrophobic or nonpolar amino acids typically 
reside in the interior or core of a protein, which are inaccessible or nearly inaccessible to solvent. 
Thus at variable core positions it may be beneficial to consider only or mostly nonpolar amino acids 
such as alanine, valine, isoleucine, leucine, phenylalanine, tyrosine, tryptophan, and methionine. 

25 Hydrophilic or polar amino acids typically reside on the exterior or surface of proteins, which have a 
significant degree of solvent accessibility. Thus at variable surface positions It may be beneficial to 
consider only or mostly polar amino acids such as alanine, serine, threonine, aspartic acid, 
asparagine, glutamlne, glutamic acid, arginine, lysine and histidine. Some positions are partly 
exposed and partly buried, and are not clearly protein core or surface positions, in a sense serving as 

30 boundary residues between core and surface residues. Thus at such variable boundary positions it 
may be beneficial to consider both nonpolar and polar amino acids such as alanine, serine, threonine, 
aspartic acid, asparagine, glutamine, glutamic acid, arginine, lysine histidine, valine, isoleucine, 
leucine, phenylalanine, tyrosine, tryptophan, and methionine. 

35 Determination of the degree of solvent exposure at variable positions may be by subjective evaluation 
or visual inspection of the antibody template by one skilled in the art of protein structural biology, or by 
the use of a variety of algorithms that are known in the art. Selection of amino acid types to be 
considered at variable positions may be aided or determined wholly by computational methods, such 
as calculation of solvent accessible surface area, or using algorithms which assess the orientation of 

40 the Calpha-Cbeta vectors relative to a solvent accessible surface, as outlined in US 6,188,965; 
6,269,312; and 6,403,312; USSNs 09/782,004; 09/927,790; and 10/218,102; PCTs 98/07254; 
01/40091 ; and 02/25588, and expressly herein incorporated by reference. In an embodiment, each 
variable position may be classified explicitly as a core, surface, or boundary position. 
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5 In an alternate embodiment, selection of the set of amino acids allowed at variable positions may be 
hypothesis-driven. Hypotheses for which amino acid types should be considered at variable positions 
may be derived by a subjective evaluation or visual Inspection of the antibody template by one skilled 
in the art of protein structural biology. For example, if it is suspected that a hydrogen bonding 
interaction may be favorable at a variable position, polar residues that have the capacity to form 

10 hydrogen bonds may be considered even if the position is in the core. Likewise, if it is suspected that 
a hydrophobic packing interaction may be favorable at a variable position, nonpolar residues that 
have the capacity to form favorable packing interactions may be considered even if the position is on 
the surface. Other examples of hypothesis-driven approaches may involve issues of backbone 
flexibility or protein fold. As is known in the art, certain residues, for example proline, glycine, and 

1 5 cysteine, play Important roles in protein structure and stability. Glycine enables greater backbone 
flexibility than all other amino acids, proline constrains the backbone more than all other amino acids, 
and cysteines may form disulfide bonds. It may therefore be beneficial to include one or more of 
these amino acid types to achieve a desired goal. Alternatively, it may be beneficial to exclude one or 
more of these amino acid types from the list of considered amino acids. 

20 

In an alternate embodiment, subsets of amino acids may be chosen to maximize coverage. In this 
case, additional amino acids with properties similar to that in the antibody template may be 
considered at variable positions. For example, if the residue at a variable position in the antibody 
template is a large hydrophobic residue, the user may choose to include additional large hydrophobic 
25 amino acids at that position. Alternatively, subsets of amino acids may be chosen to maximize 

diversity. In this case, amino acids with properties dissimilar to those in the antibody template may be 
considered at variable positions. For example, If the residue at a variable position in the antibody 
template is a large hydrophobic residue, the user may choose to include only one large hydrophobic 
amino acid in combination with other amino acids that are small, polar, etc. 

30 
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5 Selection of Rotamers to Model Amino Acids 

As is known in the art, some computational screening methods require only the identity of considered 
amino acids to be determined during design calculations. That is, no information is required 
concerning the conformations or possible conformations of the amino acid side chains. As is also 
known in the art, and in a preferred embodiment, a set of discrete side chain conformations, called 

10 rotamers, can be considered for each amino acid. Thus, a set of rotamers will be considered at each 
variable and floated position. Rotamers may be obtained from published rotamer libraries (see for 
example, Lovel et a/., 2000, Proteins: Structure Function and Genetics 40:389-408; Dunbrack & 
Cohen, 1997, Protein Science 6:1661-1681; DeMaeyer ef a/., 1997, Folding and Design 2:53-66; 
Tuffery et al. % 1991, J. BiomoL Struct. Dyn. 8:1267-1289, Ponder & Richards, 1987, J. Mot. Biol. 

15 1 93:775-791 ). As is known in the art, rotamer libraries may be backbone-independent or backbone- 
dependent. Rotamers may also be obtained from molecular mechanics or ab initio calculations, and 
using other methods. In a preferred embodiment, a flexible rotamer model is used (see Mendes et 
a/., 1999, Proteins: Structure, Function, and Genetics 37:530-543). Similarly, artificially generated 
rotamers may be used, or augment the set chosen for each amino acid and/or variable position. In a 

20 preferred embodiment, at least one conformation that is not low in energy is included in the list of 

rotamers. In an alternatively preferred embodiment, the rotamer of the variable position residue in the 
antibody template is included in the list of rotamers allowed for that variable position in the design 
calculation. In an alternative embodiment, only the identity of each amino acid considered at variable 
positions is provided, and no specific conformational states of each amino acid are used during 

25 design calculations. That Is, use of rotamers is not essential for computational screening. 

Use of Experimental Information 

In one embodiment of the present invention, experimental information may be used to guide the 
choice of variable positions, and/or the choice of considered amino acids at variable positions. As is 
30 known in the art, mutagenesis experiments are often carried out to determine the role of certain 
residues in protein structure and function, for example, which protein residues play a role in 
determining stability, or which residues make up the antigen binding site of an antibody. Data 
obtained from such experiments are useful in the present invention. 

35 For example, variable positions for affinity maturation calculation could involve varying all positions at 
which mutation has been shown to affect binding. Similarly, the results from such an experiment may 
be used to guide the choice of allowed amino acid types at variable positions. For example, if certain 
types of amino acid substitutions are found to be favorable, sets, subsets, and/or similar types of 
those amino acids may be chosen to maximize coverage. In one embodiment, additional amino acids 

40 with properties similar to that or those that were found to be favorable experimentally may be 

considered at variable positions. For example, if experimental mutation of a variable position residue 
at the antigen interface to a large hydrophobic residue was found to be favorable, the user may 
choose to include additional large hydrophobic amino acids at that position in the computational 
screen. 
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5 

As is known in the art, display and other selection technologies may be coupled with random 
mutagenesis to generate a list or lists of amino acid substitutions that are favorable for the selected 
property. Such a list or lists obtained from such experimental work find use in the present invention. 
For example, positions that are found to be invariableln such an experiment may be excluded as 

10 variable positions in computational screening calculations, whereas positions that are found to be 
more acceptable to mutation or respond favorably to mutation may be chosen as variable positions. 
Similarly, the results from such experiments may be used to guide the choice of allowed amino acid 
types at variable positions. For example, if certain types of amino acids arise more frequently in an 
experimental selection, subsets or similar types of those amino acids may be chosen to maximize 

15 coverage. In one embodiment, additional amino acids with properties similar to those that were found 
to be favorable experimentally may be considered at variable positions. For example, ff selected 
mutations at a variable position that resides at the antigen interface are found to be uncharged polar 
amino acids, the user may choose to include additional uncharged polar amino acids, or perhaps 
charged polar amino ac\ds t at that position. 

20 

Use of Sequence information 

In one embodiment of the present invention, sequence information may be used to guide choice of 
variable positions, and/or the choice of amino acids considered at variable positions. As Is known in 
the art, all antibodies share a common structural scaffold and are homologous in sequence. 

25 Furthermore, there is a large amount of sequence and structural information available for the antibody 
family of proteins. These favorable aspects of antibodies may be used to gain insight into particular 
positions in the antibody family. As is known in the art, sequence alignments are often carried out to 
determine which antibody residues are conserved and which are not conserved. That is to say, by 
comparing and contrasting alignments of-antibody sequences, the degree of variability at a position 

30 may be observed, and the types of amino acids that occur naturally at positions may be observed. 
Data obtained from such analyses are useful in the present invention. 

The benefit of using sequence information to choose variable positions and considered amino acids at 
variable positions are several fold. For choice of variable positions, the primary advantage of using 

35 sequence information is that insight may be gained into which positions are more tolerant and which 
are less tolerant to mutation. Thus sequence information may aid in ensuring that quality diversity, i.e. 
mutations that are not deleterious to protein structure, stability, etc., is sampled computationally. The 
same advantage applies to use of sequence Information to select amino acid types considered at 
variable positions. That is, the set of amino acids which occur in an antibody sequence alignment 

40 may be thought of as being pre-screened by evolution to have a higher chance than random for being 
compatible with an antibody's structure, stability, solubility, function, etc. Thus higher quality diversity 
is sampled computationally. A second benefit of using sequence information to select amino acid 
types considered at variable positions is that certain alignments may represent sequences that may 
be less Immunogenic than random sequences. For example, if the amino acids considered at a given 



29 



WO 03/074679 



PCT/US03/06598 



5 variable position are the set of amino acids which occur at that position in an alignment of human 
germ line antibody sequences, those amino acids may be thought of as being pre-screened by nature 
for generating no or low immune response if the optimized antibody is used as a human therapeutic. 

The source of the sequences may vary widely, and include one or more of the known databases, 

1 0 including but not limited to the Kabat database (.immuno.bme.nwu.edu; Johnson & Wu, 2001 , Nucleic 
Acids Res. 29:205-206; Johnson & Wu, 2000, Nucleic Acids Res. 28:214-218), the IMGT database 
(IMGT, the international ImMunoGeneTics information system®; imgt.cines.fr; Lefranc et a/., 1999, 
Nucleic Acids Res. 27:209-212; Ruiz ef a/., 2000 Nucleic Acids Res. 28:219-221 ; Lefranc et a/., 2001, 
Nucleic Acids Res. 29:207-209; Lefranc ef a/., 2003, Nucleic Acids Res. 31:307-310), and VBASE 

1 5 (.mrc-cpe.cam.ac.uk/vbase-ok.php?menu=901 ). Antibody sequence information can be obtained, 
compiled, and/or generated from sequence alignments of germ line sequences or sequences of 
naturally occurring antibodies from any organism, including but not limited to mammals. For example, 
Figures 2a and 2b list the aligned human V H and V L kappa germ line sequences, along with several 
antibody variable region sequences relevant to the examples of the present invention. Alternatively, 

20 antibody sequence information can be obtained from a database that is compiled privately. Other 
databases which are more general nucleic acid or protein databases, i.e. not particular to antibodies, 
for example including but are not limited to SwissProt (expasy.ch/sprot/), GenBank 
(ncbi.nlm.nih.gov/Genbank) and Entrez (ncbi.nlm.nih.gov/Entrez/). and EMBL Nucleotide Sequence 
Database (ebi.ac.uk/embl/), may find use in the present invention. There are numerous sequence- 

25 based alignment programs and methods known in the art, and ail of these find use in the present 
invention for generation of antibody sequence alignments. 

Once alignments are made, sequence information can be used to guide choice of variable positions. 
Such sequence information can relate the variability, natural or otherwise, of a given position. 

30 Variability herein should be distinguished from variable position. By "variability" herein is meant the 
degree to which a given position in a sequence alignment shows variation in the types of amino acids 
that occur there. Variable position, to reiterate, is a position chosen by the user to vary in amino acid 
identity during a computational screening calculation. Variability may be determined qualitatively by 
one skilled in the art of bioinformatics. There are also methods known in the art to quantitatively 

35 determine variability that may find use in the present invention. The most preferred embodiment 
measures Information Entropy or Shannon Entropy. Variable positions can be chosen based on 
sequence information obtained from closely related antibody sequences, or antibody sequences that 
are less closely related. 

40 The use of sequence information to choose variable positions finds broad use in the present 
invention. For example, to optimize antibody solubility by replacing exposed nonpolar surface 
residues, variable positions may be chosen as only that set of surface exposed positions that show a 
certain level of variability. As another example, to optimize antibody stability by mutating interdomain 
interface residues, variable positions may be chosen as only that set of interface positions that shown 
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5 a certain level of variability. For example, if an interface position in the antibody template is 

tryptophan, and tryptophan is observed at that position in greater than 90% of the sequences in an 
alignment, it may be beneficial to leave that position fixed. In contrast, if another Interface position is 
found to have a greater level of variability, for example if five different amino acids are observed at 
that position with frequencies of approximately 20% each, that position may be chosen as a variable 

10 position. In another embodiment, variable positions for affinity maturation calculations could be 
chosen to be all positions or a subset of positions which are determined by sequence alignment to 
make up a complementarity determining region (CDR) loop. Alternatively, variable positions could be 
chosen to be those residues that are determined by sequence alignment to contact a CDR loop. 
Thus, visual inspection of an aligned antibody sequence may substitute for visual inspection of an 

15 antibody structure. This Is due to the high level of both sequence and structural similarity in the 
antibody family. The rationale here is that those positions which typically contact a CDR in most 
antibody structures, for example, are hypothesized to be positions which contact a CDR in the 
antibody template being optimized in the calculation. 

20 Sequence information can also be used to guide the choice of amino acids considered at variable 
positions. Such sequence information can relate to how frequently an amino acid, amino acids, or 
amino acid types (for example polar or nonpolar, charged or uncharged) occur, naturally or otherwise, 
at a given position. In one embodiment, the set of amino acids considered at a variable position in 
design calculations may comprise the set of amino acids that is observed at that position in the 

25 alignment. Thus, the position-specific alignment information is used directly to generate the list of 
considered amino acids at a variable position in a computational screening calculation. Such a 
strategy is well known in the art. See for example Lehmann & Wyss, 2001, Ourr. Opin. Biotechnol. 
12(4):371-5.; Lehmann et a/., 2000, Biochim Biophys Acta. 1543(2):408-415; Rath & Davidson, 2000, 
Protein Sc/., 9(12):2457-69; Lehmann ef a/., 2000, Protein Eng. 1 3(1 ):49-57; Desjartais & Berg, 1993, 

30 Proc Natl Acad Sci USA, 90(6):225&-60; Desjarlais & Berg, 1 992, Proteins. 1 2(2):1 01 -4; Henikoff & 
Henikoff, 2000, Adv. Protein Chem. 54:73-97; Henikoff & Henikoff, 1994, J. Mol. Biol. 243(4):574-8; all 
herein expressly incorporated by reference. 

In an alternate embodiment, the set of amino acids considered at a variable position or positions may 
35 comprise a set of amino acids that is observed most frequently in the alignment. Thus, a certain 
criteria is applied to determine whether the frequency of an amino acid or amino acid type will be 
included in the set of amino acids that are considered at a variable position in a design calculation. 
As is known in the art, sequence alignments may be analyzed using statistical methods to calculate 
the sequence diversity at any position in the alignment and the occurrence frequency or probability of 
40 each amino acid at a position. Such data may then be used to determine which amino acids types to 
consider. In the simplest embodiment, these occurrence frequencies are calculated by counting the 
number of times an amino acid is observed at an alignment position, then dividing by the total number 
of sequences in the alignment. In other embodiments, the contribution of each sequence, position or 
amino acid to the counting procedure is weighted by a variety of possible mechanisms. In a preferred 
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5 embodiment, the contribution of each aligned sequence to the frequency statistics is weighted 

according to its diversity weighting relative to other sequences in the alignment A common strategy 
for accomplishing this is the sequence weighting system recommended by Henikoff and Henikoff (see 
Henikoff & Henikoff, 2000, Adv. Protein Chem. 54:73-97; Henikoff & Henikoff, 1994, J. Mot. Biol. 
243:574-8; both herein expressly incorporated by reference. In a preferred embodiment, the 

10 contribution of each sequence to the statistics is dependent on its extent of similarity to the target 

sequence, i.e. the antibody template used in the design calculations, such that sequences with higher 
similarity to the target sequence are weighted more highly. Examples of similarity measures include, 
but are not limited to, sequence identity, BLOSUM similarity score, PAM matrix similarity score, and 
Blast score. In an alternate embodiment, the contribution of each sequence to the statistics is 

15 dependent on its known physical or functional properties. These properties include, but are not 
limited to, thermal and chemical stability, contribution to activity, solubility, etc. For example, when 
optimizing an antibody for solubility, those sequences in an alignment that are known to be most 
soluble (for example see Ewert et a/., 2003, J. Mol.Biol 325:531-553), will contribute more heavily to 
the calculated frequencies. 

20 

Regardless of what criteria are applied for choosing the set of amino acids in a sequence alignment to 
be considered at variable positions, using sequence information to choose the set of amino acids 
considered at variable positions finds broad use in the present invention. For example, to optimize 
antibody solubility by replacing exposed nonpolar surface residues, considered amino acids may be 

25 chosen as the set of amino acids, or a subset of those amino acids which meet some criteria, that are 
observed at that position in an alignment of antibody sequences. As another example, to optimize 
antibody stability by mutating domain interface residues, considered amino acids may be chosen as 
the set of amino acids, or a subset of those amino acids that meet some criteria, that are observed at 
that position in an alignment of antibody sequences. In an alternate embodiment, one or more amino 

30 acids may be added or subtracted subjectively from a list of amino acids derived from a sequence 
alignment in order to maximize coverage. For example, additional amino acids with properties similar 
to those that are found in a sequence alignment may be considered at variable positions. For 
example, if an antigen binding position is observed to have uncharged polar amino acids in an 
antibody sequence alignment, the user may choose to include additional uncharged polar amino acids 

35 in an affinity maturation calculation, or perhaps charged polar amino acids, at that position. 

In a preferred embodiment, sequence alignment is not used alone in the analysis step of the present 
invention; that is, sequence information is combined with energy calculation, as discussed below. For 
example, pseudo energies can be derived from sequence information to generate a scoring function. 
40 The use of a sequence-based scoring function may assist in significantly reducing the complexity of a 
calculation. However, as is appreciated by those skilled in the art, the use of a sequence-based 
scoring function alone may be inadequate because sequence information can often indicate 
misleading correlations between mutations that may in reality be structurally conflicting. Thus, in a 
preferred embodiment, a structure-based method of energy calculation is used, either alone or in 

■ 
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5 combination with a sequence-based scoring function. That is, preferred embodiments do not rely on 
sequence alignment Information alone as the analysis step. 

Energy Calculation 

Some method of scoring each amino acid substitution, herein referred to as energy calculation, is 
10 required for computational screening. As previously discussed, there are a variety of ways to 
represent amino acids in order to enable efficient energy calculation. 

In a preferred embodiment, considered amino acids are represented as rotamers, as described 
previously, and the energy (or score) of interaction of each possible rotamer at each variable position, 

15 or at each variable and floated position, with the template and/or other rotamers, is calculated. It 
should be understood that the template in this case includes both the atoms of the protein structure 
backbone, as well as the atoms of any fixed residues, as well as non-protein atoms. In a preferred 
embodiment, two sets of interaction energies are calculated for each side chain rotamer at every 
position: the interaction energy between the rotamer and the template (the "singles" energy), and the 

20 interaction energy between the rotamer and all other possible rotamers at every other variable and 
floated position (the "doubles" energy). In an alternate embodiment, singles and doubles energies are 
calculated for fixed positions as well as for variable and floated positions. 

In an alternate embodiment, considered amino acids are not represented as rotamers. 

25 

In one embodiment, molecular dynamics calculations may be used to computationally screen 
sequences by individually calculating mutant sequence scores. 

Regardless of how amino acids are represented, the energies of interaction are measured by one or 
30 more scoring functions. A variety of scoring functions find use in the present invention for calculating 
energies. As will be appreciated by those skilled in the art, certain scoring functions are more 
compatible with certain types of methods for representing amino acids. For example, force fields are 
particularly well suited to score amino acid substitutions that are represented as rotamers. However, 
in order to not constrain the present invention to any particular application or theory of operation, a 
35 variety of scoring functions are presented that may find use in the present invention regardless of how 
amino acids are represented. 

Scoring functions may include a number of potentials, herein referred to as the energy terms of a 
scoring function, including but are not limited to, a van der Waals potential scoring function, a 
40 hydrogen bond potential scoring function, an atomic solvation potential scoring function, a secondary 
structure propensity potential scoring function and an electrostatic potential scoring function. At least 
one energy term is used to score each variable or floated position, although the energy terms may 
differ depending on the position classification or other considerations. 
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5 A variety of scoring functions are described in US 6,188,965; 6,269,312; and 6,403,312; USSNs 
09/782,004; 09/927,790; 09/877,695; 10/071,859 and 10/218,102; PCTs 98/07254; 01/40091; and 
02/25588, all of which are herein expressly incorporated by reference. As will be appreciated by 
those skilled in the art, a number of force fields, which are comprised of one or more energy terms, 
may serve as scoring functions. Force fields include, but are not limited to, ab initio or quantum 

10 mechanical force fields, seml-emplrica! force fields, and molecular mechanics force fields. In an 
alternate embodiment, scoring functions that are knowledge-based may be used. In an alternate 
embodiment, scoring functions that use statistical methods may find use in the present invention. 
These methods may be used to assess the match between a sequence and a three-dimensional 
protein structure, and hence may be used to score amino acid substitutions for fidelity to the protein 

15 structure. 

In a preferred embodiment, additional energy terms may be included in the scoring function. For 
example, the above mentioned scoring functions may be modified to include terms including but not 
limited to torsional potentials, entropy potentials, additional solvation models including contact models, 

20 solvent exclusion models, and knowledge-based energies derived from protein sequence and/or 
structure statistics including but not limited to threading potentials, reference energies, pseudo 
energies, and sequence biases derived from sequence alignments (as discussed in the previous 
section). In a preferred embodiment, a scoring function is modified to include models for 
immunogenicity, such as functions derived from data on binding of peptides to MHC (Major 

25 Histocompatability Complex), that may be used to identify potentially immunogenic sequences (see 
USSNs 09/903,378; 10/039,170; 60/222,697 and USSN to be determined, filed January 8, 2003 and 
entitled "NOVEL PROTEIN WITH ALTERED IMMUNOGENICITY"; and PCT 01/21823; and 02/00165, 
all herein expressly incorporated by reference). 

30 In one embodiment, as is known in the art, one or more scoring functions may be optimized or 

"trained" during the computational analysis, and then the analysis re-run using the optimized system. 
Such altered scoring functions may be obtained for example, by training a scoring function using 
experimental data. 

35 In a preferred embodiment, the scoring functions used are one or more of the scoring functions which 
are described in US 6,188,965; 6,269,312; and 6,403,312; USSNs 09/782,004; 09/927,790; 
09/877,695; 10/071,859 and 10/218,102; PCTs 98/07254; 01/40091; and 02/25588, all herein 
expressly incorporated by reference. In an alternate embodiment, energy calculation is carried out 
using one or more of the methods described above in combination. 

40 

In the most preferred embodiment, a scoring function using more than one energy term is used. As 
will be appreciated by those skilled in the art, Ig domain stabilization using only a van der Waals 
potential (Looger & Hellinga, 2001, J. Mol. Biol. 307:429-445) or affinity maturation using only an 
electrostatic potential may be inadequate for accurately evaluating the complex interactions in an 
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5 antibody and between an antibody and its antigen. In the most preferred embodiment, energies may 
be calculated using a force field containing energy terms describing van der Waals, solvation, 
electrostatic, hydrogen bond interactions and combinations thereof. In additional embodiments, 
additional energy terms include but are not limited to entropic terms, torsional energies, and 
knowledge-based energies. 

10 

Combinatorial Optimization 

An important component of computational screening is the identification of one or more sequences 
that have a favorable score or are low in energy. In a preferred embodiment, all possible interaction 
energies are calculated prior to optimization. In an alternatively preferred embodiment, energies may 
1 5 be calculated as needed during optimization. 

The need for a combinatorial optimization algorithm is illustrated by examining the number of 
possibilities that are considered in a typical design calculation. The discrete nature of rotamer sets 
allows a simple calculation of the number of possible rotameric sequences for a given design 

20 problem. A backbone of length n with m possible rotamers per position will have m n possible rotamer 
sequences, a number which grows exponentially with sequence length. For very simple design 
calculations, it is possible to examine each possible sequence in order to identify the optimal 
sequence and/or one or more favorable sequences. However, for a typical design problem, the 
number of possible sequences (up to 10 a0 or more) is sufficiently large that examination of each 

25 possible sequence is intractable. A variety of combinatorial optimization algorithms may then be used 
to identify the optimum sequence and/or one or more favorable sequences. 

Combinatorial optimization algorithms may be divided into two classes: (1) those that are guaranteed 
to return the global minimum energy configuration if they converge, and (2) those that are not 

30 guaranteed to return the global minimum energy configuration, but which will always return a solution. 
Examples of the first class of algorithms include, but are not limited to, Dead-End Elimination (DEE) 
and Branch & Bound (B&B) (including Branch and Terminate) (Gordon & Mayo, 1999, Structure Fold. 
Des. 7:1089-98). Examples of the second class of algorithms include, but are not limited to, Monte 
Carlo (MC), self-consistent mean field (SCMF), Boltzmann sampling (Metropolis et a/., 1953, J. Chem. 

35 Phys. 21 :1 087), simulated annealing (Kirkpatrick ef a/., 1 983, Science, 220:671-680), genetic 
algorithm (GA) and Fast and Accurate Side-Chain Topology and Energy Refinement (FASTER 
(Desmet, ef a/., 2002, Proteins, 48:31-43). A combinatorial optimization algorithm may be used alone 
or in conjunction with another combinatorial optimization algorithm. 

40 In one embodiment of the present invention, the strategy for applying a combinatorial optimization 
algorithm is to find the global minimum energy configuration. In an alternate embodiment, the 
strategy is to find one or more low energy or favorable sequences. In an alternate embodiment, the 
strategy is to find the global minimum energy configuration and then find one or more low energy or 
favorable sequences. For example, as outlined in US 6,269,312 and PCT US98/07254, preferred 
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5 embodiments utilize a Dead End Elimination (DEE) step, and preferably a Monte Carlo step. In other 
embodiments tabu search algorithms are used or combined with DEE and/or Monte Carlo, among 
other search methods (see Modern Heuristic Search Methods, edited by V.J. Rayward-Smith, et a/., 
1996, John Wiley & Sons Ltd., hereby expressly incorporated by reference in its entirety and also 
USSN 10/218,102 and PCT 02/25588). In another preferred embodiment, a genetic algorithm may be 
10 used. See, USSN 09/877,695 and 10/071,859, both herein expressly incorporated by reference. As 
another example, as is more fully described in US 6,188,965; 6,269,312; and 6,403,312; USSNs 
09/782,004; 09/927,790; and 10/218,102; PCTs 98/07254; 01/40091; and 02/25588, which are herein 
expressly incorporated by reference, the global optimum may be reached, and then further 
computational processing may occur, which generates additional optimized sequences. 

15 

In the simplest embodiment, design calculations are not combinatorial. That is, energy calculations 
are used to evaluate the amino acid substitutions individually at single variable positions. However, it 
is a more preferred embodiment in certain situations to combine design calculations and also to 
evaluate amino acid substitutions at more than one variable positions. 

20 

Library Generation 

The output sequence or sequences from computational screening may be used to generate an 
experimental library. By "experimental library" herein is meant a list of one or more protein variants, 
existing either as a list of amino acid sequences or a list of the nucleotides sequences encoding them. 

25 Such a library may then be screened experimentally to single out superior members of antibody 

variants that are optimized for the desired property. As discussed above, computationally screened 
libraries have a number of benefits. Computationally generated libraries are significantly enriched in 
stable, properly folded, and functional sequences relative to randomly generated libraries. Because of 
the overlapping sequence constraints on antibody structure, stability, solubility, function, etc., a large 

30 number of the candidates In an experimental library occupy "wasted" sequence space. For example, 
a large fraction of sequence space encodes unfolded, misfolded, incompletely folded, partially folded, 
or aggregated proteins. In contrast, experimental libraries that are screened computationally are 
composed primarily of productive sequence space. As a result, computational screening increases 
the chances of identifying antibodies that are broadly optimized for stability, solubility, and affinity for 

35 antigen. In effect, computational screening yields an increased hit-rate, thereby decreasing the 

number of variants that must be screened experimentally. The term "experimental library" may refer 
to the set of optimized antibodies in any form. In one embodiment, the library is a list of nucleic acid 
or amino acid sequences, or a list of nucleic acid or amino acid substitutions at variable positions. For 
example, the examples used to illustrate the present invention below provide experimental libraries as 

40 amino acid substitutions at variable positions. In an alternate embodiment, the library is a physical 
library composed of nucleic acids that encode the optimized library sequences. Said nucleic acids 
may be the genes encoding the optimized antibodies, the genes encoding the optimized antibodies 
with any operably linked nucleic acids, or expression vectors encoding the library members together 
with any other operably linked regulatory sequences, selectable markers, fusion constructs, and/or 
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5 other elements. For example, the experimental library may be a set of mammalian expression vectors 
that encode library members, the protein products of which may be subsequently expressed, purified, 
and screened experimentally. As another example, the experimental library may be a display library. 
Such a library could, for example, be composed of a set of expression vectors which encode library 
members operably linked to some fusion partner that enables phage display, ribosome display, yeast 
10 display, bacterial surface display, and the like. Such a library could be used, for example, to screen 
for antibodies against a target antigen, or to affinity mature a particular antibody. In an alternate 
embodiment, the library is a physical library that is comprised of the optimized antibody proteins, 
either in purified or unpurified form. 

1 5 In one embodiment, an experimental library is a list of at least one sequence that are variant 
antibodies optimized for a desired property. For example see, Filikov et a/., 2002, Protein Set. 
1 1:1452-1461 and Luo et a/., 2002, Protein Sci 11:1218-1226. In an alternate embodiment, an 
experimental library may be defined as a combinatorial list, meaning that each a list of amino acid 
substitutions is designed for each variable position, with the implication that each substitution is to be 

20 combined with all other designed substitutions at all other variable positions. In this case, expansion 
of the combination of all possibilities at all variable positions results in a large explicitly defined library. 

Selecting Sequences for the Experimental Library 

As is known in the art, there are a variety of ways that an experimental library may be derived from 
25 the output of computational screening calculations. For example, methods of library generation 
described in US 6,403,312; USSNs 09/782,004; 09/927,790; and 10/218,102; PCTs 01/40091; and 
02/25588, herein expressly incorporated by reference, find use in the present invention. 

• In one embodiment, sequences scoring within a certain range of-the global optimum sequence may 
30 be included in the library. For example, all sequences within 10 kcal/mol of the lowest energy 

sequence could be used as the experimental library. In an alternate embodiment, sequences scoring 
within a certain range of one or more local minima sequences may be used. In a preferred 
embodiment, the library sequences are obtained from a filtered set. Such a list or set may be 
generated by a variety of methods, as is known in the art, for example using an algorithm such as 
35 Monte Carlo, B&B, or SCMF. For example, the top 10 3 or the top 10 5 sequences in the filtered set 
may comprise the experimental library. Alternatively, the total number of sequences defined by the 
combination of all mutations may be used as a cutoff criterion for the experimental library. Preferred 
values for the total number of recombined sequences range from 10 to 10 20 , particularly preferred 
values range from 100 to 10°. Alternatively, a cutoff may be enforced when a predetermined number 
40 of mutations per position is reached. 

Clustering algorithms may be useful for classifying sequences derived by computational screening 
methods into representative groups. For example, methods of clustering and their application 
described in USSN 10/218,102 and PCT 02/25588, herein expressly incorporated by reference, find 
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5 use in the present invention. Representative groups may be defined, for example, by similarity. 

Measures of similarity include, but are not limited to sequence similarity and energetic similarity. Thus 
the output sequences from computational screening may be clustered around local minima, referred 
to herein as clustered sets of sequences. For example, sets of sequences that are close in sequence 
space may be distinguished from other sets. In one embodiment, coverage within one or a subset of 

10 clustered sets may be maximized by including in the experimental library some, most, or all of the 
sequences that make up one or more clustered sets of sequences. For example, the user may wish 
to maximize coverage within the one, two, or three lowest energy clustered sets by including the 
majority of sequences within these sets in the library. In an alternate embodiment, diversity across 
clustered sets of sequences may be sampled by including within an experimental library only a subset 

15 of sequences within each clustered set. For example, all or most of the clustered sets could be 

broadly sampled by including the lowest energy sequence from each clustered set in the experimental 
library. 

In some embodiments, sequences that do not make the cutoff are included in the experimental library, 
20 This may be desirable in some situations, for instance to evaluate the approach to library generation, 
to provide controls or comparisons, or to sample additional sequence space. For example, the WT 
antibody sequence may be included in the library, even if it does not make the cutoff. 

The set of antibody sequences in an experimental library is generally, but not always, significantly 
25 different from the wild type antibody template, although in some cases the library preferably contains 
the wild-type sequence. The range of optimized protein sequences is dependent upon many factors 
including the size of the protein, properties desired, etc. 

Use of Sequence Information to Guide Library Generation 

30 In one embodiment of the present invention, sequence information may be used to guide or filter a 
computationally screened output for generation of an experimental library. As discussed, by 
comparing and contrasting alignments of antibody sequences, the degree of variability at a position 
and the types of amino acids which occur naturally at that position may be observed. Data obtained 
from such analyses are useful in the present invention. The benefits of using sequence information 

35 have been discussed, and those benefits apply equally to use of sequence information to guide library 
generation. The set of amino acids which occur in an antibody sequence alignment may be thought 
of as being pre-screened by evolution to have a higher chance than random at being compatible with 
an antibody's structure, stability, solubility, function, etc. Furthermore, certain alignments may provide 
represent sequences that are less immunogenic than random sequences. The variety of sequence 

40 sources, as well as the methods for generating antibody sequence alignments that have been 

discussed find use in the application of sequence information to guiding library generation. Likewise, 
as discussed above, various criteria may be applied to determine the importance or weight of certain 
residues in an alignment. These methods also find use in the application of sequence information to 
guide library generation. 
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5 

Using sequence information to guide library generation from the results of computational screening 
finds broad use in the present invention. In one embodiment, sequence information is used to filter 
sequences from computational screening output. That is to say, some substitutions are subtracted 
from the computational output to generate the experimental library. For example, to optimize antibody 

10 solubility by replacing exposed nonpolar surface residues, the resulting output of a computational 
screening calculation or calculations may be filtered so that the experimental library includes only 
those amino acids, or a subset of those amino acids which meet some criteria, that are observed at 
that position in an alignment of antibody sequences. In an alternate embodiment, sequence 
information is used to add sequences to the computational screening output. That is to say, 

15 sequence information is used to guide the choice of additional amino acids that are added to the 

computational output to generate the experimental library. For example, to optimize antibody stability 
by mutating domain interface residues, the output set of amino acids for a given position from a 
computational screening calculation may be augmented to include one or more amino acids that are 
observed at that position in an alignment of antibody sequences. In an alternate embodiment, based 

20 on sequence alignment information, one or more ammo acids may be added to or subtracted from the 
computational screening sequence output in order to maximize coverage or diversity. For example, 
additional amino acids with properties similar to those that are found in a sequence alignment may be 
added to the experimental library. For example, if a position involved in antigen binding is observed to 
have uncharged polar amino acids in an antibody sequence alignment, the user may choose to 

25 include additional uncharged polar amino acids to the experimental library at that position. 

Generation of Secondary Libraries 

In one embodiment of the present invention, libraries may be processed further to generate 
subsequent libraries. In this way, the output from a computational screening calculation or 

30 calculations may be thought of as a primary library. This primary library may be combined with other 
primary libraries from other calculations or other experimental libraries, processed using subsequent 
calculations, sequence information, or other analyses, or processed experimentally to generate a 
subsequent library, herein referred to as a secondary library, which could become an experimental 
library. As will be appreciated from this description, the use of sequence information to guide or filter 

35 libraries, discussed above, is itself one method of generating secondary libraries from primary 

libraries. Generation of secondary libraries gives the user greater control of the parameters within an 
experimental library. This enables more efficient experimental screening, and may allow feedback 
from experimental results to be interpreted more easily, providing a more efficient 
design/experimentation cycle. 

40 

There are a wide variety of methods to generate secondary libraries from primary libraries. For 
example, USSN 10/218,102 and PCT 02/25588, herein expressly incorporated by reference, 
describes methods for secondary library generation that find use in the present invention. Typically 
some selection step occurs in which a primary library is processed in some way. For example, in one 
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5 embodiment a selection step occurs where some set of primary sequences are chosen to form the 
secondary library. In an alternate embodiment, a selection step is a computational step, again 
generally including a selection step, wherein some subset of the primary library is chosen and then 
subjected to further computational analysis, Including both further computational screening as well as 
techniques such as "in sfllco* shuffling (recombination). See, for example US 5,830,721 ; 5,81 1 ,238; 

10 5,605,793; 5,837,458, PCT US/19256, Rachitt-Enchira (.enchira.com/gene_shuffling.htm); error-prone 
PCR, for example using modified nucleotides; known mutagenesis techniques including the use of 
multi-cassettes; DNA shuffling (Crameri et a/., 1998, Nature 391:288-291); heterogeneous DNA 
samples (US 5,939,250); ITCHY (Ostermeier et at., 1999, Nat BiotechnoL 17:1205-1209); StEP 
(Zhao e/a/., 1998, Nat. BiotechnoL 16:258-261), GSSM (US 6,171,820 and US 5,965,408); in vivo 

15 homologous recombination, ligase assisted gene assembly, end-complementary PCR, profusion 
(Roberts & Szostak, 1997, Prvc. Natl Acad. Set. USA 94:12297-12302); yeast/bacteria surface 
display (Lu et a/., 1995, Biotechnology 13:366-372); Seed & Aruffo, 1987, Prvc. NatL Acad. Set. USA 
84(10):3365-3369; Boder & Wittrup, 1997, Nat. BiotechnoL 15:553-557). all hereby Incorporated by 
reference. In an alternate embodiment, a selection step occurs that is an experimental step, for 

20 example any of the experimental library screening steps below, wherein some subset of the primary 
library is chosen and then recombined experimentally, for example using one of the directed evolution 
methods discussed below, to form a secondary library. In a preferred embodiment, the primary 
library is generated and processed as outlined in US 6,403,312, which is herein expressly 
incorporated by reference. 

25 

Generation of secondary and subsequent libraries finds broad use in the present invention. In one 
embodiment, different primary libraries may be combined to generate a secondary or subsequent 
library. In another embodiment, secondary libraries may be generated by sampling sequence 
diversity at highly mutatable or highly conserved positions. The primary library may be analyzed to 

30 determine which amino acid positions in the template protein have high mutational frequency, and 
which positions have low mutational frequency. For example, positions in an antibody that show a 
great deal of mutational diversity in computational screening may be fixed in a subsequent round of 
design calculations. A filtered set of the same size as the first would now show diversity at positions 
that were largely conserved in the first library. Alternatively, the secondary library may be generated 

35 by varying the amino acids at the positions that have high numbers of mutations, while keeping 
constant the positions that do not have mutations above a certain frequency. 

This discussion is not meant to constrain generation of libraries subsequent to primary libraries to 
secondary libraries. As will be appreciated, primary and secondary libraries may be processed further 
40 to generate tertiary libraries, quaternary libraries, and so on. In this way, library generation is an 

iterative process. For example, tertiary libraries may be constructed using a variety of additional steps 
applied to one or more secondary libraries; for example, further computational processing may occur, 
secondary libraries may be recombined, or subsets of different secondary libraries may be combined. 
In a preferred embodiment, a tertiary library may be generated by combining secondary libraries. For 
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5 example, primary and/or secondary libraries that analyzed different parts of a protein may be 

combined to generate a tertiary library that treats the combined parts of the protein. In an alternate 
embodiment, the variants from a primary library may be combined with the variants from a second 
library to provide a combined tertiary library at lower computational cost than creating a very long 
filtered set. These combinations may be used, for example, to analyze large proteins, especially large 

10 multi-domain proteins. Thus the above description of secondary library generation applies to 

generating any library subsequent to a primary library, the end result being a final library that may 
screened experimentally to obtain optimized antibodies. These examples are not meant to constrain 
generation of secondary libraries to any particular application or theory of operation for the present 
invention. Rather, these examples are meant to illustrate that generation of secondary libraries, and 

1 5 subsequent libraries such as tertiary libraries and so on, is broadly useful in computational screening 
methodology for experimental library generation. 

Experimental Library Screening 

Once an experimental library is designed using any of the methods outlined herein or combinations 
20 thereof, the physical library may be constructed using a variety of techniques. The library may then 
be screened to obtain antibodies optimized for greater stability, solubility, and/or enhanced affinity for 
antigen. Accordingly, the present invention provides a variety of methods for constructing and 
screening experimental libraries. These methods are not meant to constrain the present invention to 
any particular application or theory of operation. Rather, the provided examples are meant to 
25 illustrate generally that computationally screened libraries may be screened experimentally to obtain 
antibodies with optimized physico-chemical properties. General methods for antibody molecular 
biology, expression, purification, and screening are described in Antibody Engineering . 2001, edited 
by Duebel & Kontermann, Springer-Verlag, Heidelberg; Hayhurst & Georgiou, 2001, Cum Opin. 
Chem. Biol. 5:683-689; Maynard & Georgiou, 2000, Annu. Rev. Biomed. Eng. 2:339-76; all of which 
30 are herein expressly incorporated by reference. 

Molecular Biology and Library Generation 

In one embodiment of the present invention, the experimental library sequences are used to create 
nucleic acids such as DNA which encode the antibody member sequences and which may then be 

35 cloned into host cells, expressed and assayed, if desired. Thus, nucleic acids, and particularly DNA, 
may be made which encode each member protein sequence. These practices are carried out using 
well-known procedures. For example, a variety of methods that may find use in the present invention 
are described In Molecular Cloning- A Laboratory Manual . 3 rt Ed. (Maniatis, Cold Spring Harbor 
Laboratory Press, New York, 2001), and Current Protocols in Molecular Biology (Wiley & Sons, 

40 mrw2.interscience.wiley.com/cponline/), both of which are herein expressly incorporated by reference. 

As will be appreciated by those in the art, the generation of exact sequences for a library comprising a 
large number of sequences is potentially expensive and time consuming. Accordingly, there are a 
variety of techniques that may be used to efficiently generate experimental libraries of the present 
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5 invention. Such methods that may find use in the present invention are described or referenced in US 
6,403,312; USSN 09/782,004; 09/927,790 and 10/218,102; and PCTs 01/40091 and 02/25588, all 
hereby incorporated by reference. Such methods include but are not limited to gene assembly 
methods, PCR-based method and methods which use variations of PCR, ligase chain reaction-based 
methods, pooled oligo methods such as those used in synthetic shuffling, error-prone amplification 
10 methods and methods which use oligos with random mutations, classical site-directed mutagenesis 
methods, cassette mutagenesis, and other amplification and gene synthesis methods. As is known in 
the art, there are a variety of commercially available kits and methods for gene assembly, 
mutagenesis, vector subcloning, and the like, and such commerciai products find use in the present 
invention for generating nucleic acids that encode members of an experimental library. 

15 

Protein Expression 
Expression Systems 

The library antibody proteins of the present invention may be produced by culturing a host ceil 
transformed with nucleic acid, preferably an expression vector, containing nucleic acid encoding an 
20 library protein, under the appropriate conditions to induce or cause expression of the library protein. 
The conditions appropriate for library protein expression will vary with the choice of the expression 
vector and the host cell, and will be easily ascertained by one skilled in the art through routine 
experimentation. 

25 A wide variety of appropriate host cells may be used, including but not limited to mammalian cells, 
bacteria, insect cells, and yeast. For example, a variety of cell lines that may find use in the present 
invention are described in the ATCC cell line catalog (atcc.org), herein expressly incorporated by 
reference. . 

30 In a preferred embodiment, the library proteins are expressed in mammalian expression systems, 
including systems in which the expression constructs are introduced into the mammalian cells using 
virus such as retrovirus or adenovirus. Any mammalian cells may be used, with mouse, rat, primate 
and human cells being particularly preferred. Suitable cells also include known research cells, 
including but not limited to Jurkat T cells, NIH3T3 cells, CHO, COS, etc. In an alternately preferred 

35 embodiment, library proteins are expressed in bacterial systems. Bacterial expression systems are 
well known in the art, and include Escherichia coli(E. co//), Bacilius subtilis. Streptococcus cremohs, 
and Streptococcus lividans. In an alternate embodiment, library proteins are produced in insect cells. 
In an alternate embodiment, library proteins are produced in yeast cells. In an alternate embodiment 
library proteins are expressed in vitro using cell free translation systems. In vitro translation systems 

40 derived from both prokaryotic (e.g. E. coli) and eukaryotic (e.g. wheat germ, rabbit reticulocytes) cells 
are available and may be chosen based on the expression levels and functional properties of the 
protein of interest. For example, as appreciated by those skilled in the art, in vitro translation is 
required for some display technologies, for example ribosome display. In addition, the library proteins 
may be produced by chemical synthesis methods. 
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5 

Expression Vectors 

The nucleic acids that encode the antibody library members may be incorporated into an expression 
vector in order to express the protein. A variety of expression vectors may be utilized to express the 
library proteins. Expression vectors may comprise self-replicating extra-chromosomal vectors or 
10 vectors which integrate into a host genome. Expression vectors are constructed to be compatible with 
the host cell type. Thus expression vectors which find use in the present invention include but are not 
limited to those which enable protein expression in mammalian cells, bacteria, insect cells, and yeast. 
As is known in the art, a variety of expression vectors are available, commercially or otherwise, that 
may find use in the present invention for expressing antibody library proteins. 

15 

Expression vectors typically comprise a library member operably linked with control or regulatory 
sequences, selectable markers, any fusion partners, and/or additional elements. By "operably linked" 
herein is meant that the nucleic acid is placed into a functional relationship with another nucleic acid 
sequence. Generally, these expression vectors include transcriptional and transiationa! regulatory 

20 nucleic acid operably linked to the nucleic acid encoding the library antibody, and are typically 
appropriate to the host cell used to express the library protein. In general, the transcriptional and 
translational regulatory sequences may include, but are not limited to, promoter sequences, ribosomal 
binding sites, transcriptional start and stop sequences, translational start and stop sequences, and 
enhancer or activator sequences. As is also known in the art, expression vectors typically contain a 

25 selection gene or marker to allow the selection of transformed host cells containing the expression 
vector. Selection genes are well known in the art and will vary with the host cell used. 

Fusion Partners 

Antibody library members may be operably linked to a fusion partner to enable targeting of the 
30 expressed protein, purification, screening, display, and the like. Fusion partners may be linked to the 
library member sequence via a linker sequences. The linker sequence will generally comprise a small 
number of amino acids, typically less than ten, although longer linkers may also be used. Typically, 
linker sequences are selected to be flexible and resistant to degradation. As will be appreciated by 
those skilled in the art, any of a wide variety of sequences may be used as linkers. For example, a 
35 common linker sequence comprises the amino acid sequence GGGGS. 

A fusion partner may be a targeting or signal sequence that directs library antibody protein and any 
associated fusion partners to a desired cellular location or to the extracellular media. As is known in 
the art, certain signaling sequences may target a protein to be either secreted into the growth media, 
40 or into the periplasmic space, located between the inner and outer membrane of the cell. 

A fusion partner may also be a sequence that encodes a peptide or protein that enables purification 
and/or screening. Such fusion partners include but are not limited to polyhistidine tags (for example 
Hise and His 10 or other tags for use with Immobilized Metal Affinity Chromatography (IMAC) systems 
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5 (e.g. Ni* 2 affinity columns)), GST fusions, MBP fusions, Strep-tag, the BSP biotinylation target 
sequence of the bacterial enzyme BirA, and epitope tags which are targeted by antibodies (for 
example to c-myc tags, flag tags, and the like). As will be appreciated by those skilled in the art, such 
tags may be useful for purification, for screening, or both. For example, an antibody fragment may be 
purified using a His-tag by immobilizing it to a Ni* 2 affinity column, and then after purification the same 
10 His-tag may be used to immobilize the antibody to a Nf 2 coated plate to perform an ELISA or other 
binding assay (see "Screening of Library Members" section below). 



A fusion partner may enable the use of a selection method to screen antibody library members (see 
"Screening based on selection methods" below). Fusion partners which enable a variety of selection 

15 methods are well-known in the art, and all of these find use in the present invention. For example, by 
fusing the members of an antibody library to the gene III protein, phage display can be used (Kay et 
a/., 1996, Phage display of peptides and proteins: a laboratory manual, Academic Press, San Diego, 
CA); Lowman e/a/., 1991, Biochemistry 30:1 0832-1 0838; Smith, 1985, Science 228:1315-1317). 
Fusion partners may enable antibody library members to be labeled. Alternatively, a fusion partner 

20 may bind to a specific sequence on the expression vector, enabling the fusion partner and associated 
antibody library member to be linked covalently or noncovaiently with the nucleic acid that encodes 
them. For example, USSNs 09/642,574; 10/080,376; 09/792,630; 10/023,208; 09/792,626; 
10/082,671; 09/953,351; 10/097,100; and 60/366,658; PCTs 00/22906; 01/49058; 02/04852; 
02/04853; 02/08023; 01/28702; and 02/07466; all herein expressly incorporated by reference, 

25 describe such a fusion partner and technique that may find use in the present invention. 



Transformation and Transfection Methods 

The methods of introducing exogenous nucleic acid into host cells is well known in the art, and will 
. vary with the host cell used. Techniques include but are not limited to dextran-mediated transfection, 
30 calcium phosphate precipitation, calcium chloride treatment, polybrene mediated transfection, 

protoplast fusion, electroporation, viral or phage infection, encapsulation of the polynucleotide(s) in 
liposomes, and direct microinjection of the DNA into nuclei. In the case of mammalian cells, 
transfection may be either transient or stable. 

35 Protein Purification 

In a preferred embodiment, antibody library members are purified or isolated after expression. 
Antibodies may be isolated or purified in a variety of ways known to those skilled in the art. Standard 
purification methods include chromatographic techniques, including ion exchange, hydrophobic 
interaction, affinity, sizing or gei filtration, and reversed-phase, carried out at atmospheric pressure or 

40 at high pressure using systems such as FPLC and HPLC. Purification methods also include 
electrophoretic, immunological, precipitation, dialysis, and chromatofocusing techniques. 
Ultrafiltration and diafiltration techniques, in conjunction with protein concentration, are also useful. 
As is well known in the art, a variety of natural proteins bind antibodies, and these proteins can find 
use in the present invention for purification of antibody library members. For example, the bacterial 
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5 proteins A and G bind to the Fc region, and the bacterial protein L binds to the Fab region. 
Purification can often be enabled by a particular fusion partner. For example, antibody library 
members may be purified using glutathione resin if a GST fusion is employed, Ni* 2 affinity 
chromatography if a His tag is employed, or immobilized anti-flag antibody If a flag tag is used. For 
general guidance in suitable purification techniques, see Protein Purification: Principles and Practice. 
10 3 rd Ed., Scopes, Springer-Verlag, NY, 1994, hereby expressly incorporated by reference. 

The degree of purification necessary will vary depending on the screen or use of the antibody library 
members. In some instances no purification is necessary. For example in one embodiment, if library 
antibodies are secreted, screening may take place directly from the media. As is well known in the 
1 5 art, some methods of selection do not involve purification of library proteins. Thus, for example, if the 
optimized antibody sequences are made into a phage display library, antibody purification may not be 
performed. 

Screening of Library Members 

20 Library members may be screened using a variety of methods, including but not limited to those that 
use in vitro assays, in vivo and cell-based assays, and selection technologies. 'Automation and high- 
throughput screening technologies may be utilized In the screening procedures. Screening may 
employ the use of a fusion partner or label. The use of fusion partners has been discussed above. 
By "labeled" herein is meant that the antibodies of the invention have one or more elements, isotopes, 

25 or chemical compounds attached to enable the detection in a screen. In general, labels fall into three 
classes: a) immune labels, which may be an epitope incorporated as a fusion partner that is 
recognized by an antibody, b) isotopic labels, which may be radioactive or heavy isotopes, and c) 
small molecule labels, which may Include fluorescent and colorimetric dyes, or molecules such as 
biotin which enable-other labeling methods. Labels may be incorporated into the compound at any 

30 position and may be incorporated in vitro or in vivo during antibody expression. 

In vitro Assays 

In a preferred embodiment, the functional and/or biophysical properties of antibody library members 
are screened In an in vitro assay. In vitro assays may allow a broad dynamic range for screening 
35 antibody properties of interest. Properties of library members that may be screened include but are 
not limited to stability, solubility, and affinity for antigen, antibody receptors, or other proteins which 
are known to bind the antibody being optimized. Multiple properties may be screened simultaneously 
or individually. Proteins may be purified or unpurified, depending on the requirements of the assay. 

40 In one embodiment, the screen is a qualitative or quantitative binding assay for binding of antibody 
library members to a protein or nonprotein molecule that is known to bind the antibody. In a preferred 
embodiment, the screen is a binding assay for measuring the binding of antibody library members to 
the antibody's antigen. In an alternately preferred embodiment, the screen is an assay for antibody 
binding to an antibody receptor or some other protein that is known to bind antibodies. For example, 
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5 a number of proteins are known to bind the Fc region (Ravetch & Bolland, 2001 , Ann. Rev. Immunol. 
19:275-90; Raghavan & Bjorkman, 1996, Annu. Rev. Cell Dev. Biol. 12:181-220), including the family 
of FcyRs, the neonatal receptor FcRn, the complement protein C1q, and the bacterial proteins A and 
G. Binding assays can be carried out using a variety of methods known in the art. These methods 
include but are not limited to FRET (Fluorescence Resonance Energy Transfer) and BRET 

10 (Bioluminescence Resonance Energy Transfer) -based assays, AlphaScreen (Amplified Luminescent 
Proximity Homogeneous Assay), Scintillation Proximity Assay, ELISA (Enzyme-Linked 
Immunosorbent Assay), SPR (Surface Piasmon Resonance) or BIACORE, isothermal titration 
calorimetry, differential scanning calorimetry, gel electrophoresis, and chromatography including gel 
filtration. These and other methods may take advantage of some fusion partner or label of the 

15 antibody library member. Assays may employ a variety of detection methods including but not limited 
to chromogenic, fluorescent, luminescent, or isotopic labels. 

The biophysical properties of antibodies, for example stability and solubility, may be screened using a 
variety of methods known in the art. Protein stability may be determined by measuring the 

20 thermodynamic equilibrium between folded and unfolded states. For example, antibody library 

members of the present invention may be unfolded using chemical denaturant, heat, or pH, and this 
transition may be monitored using methods including but not limited to circular dichroism 
spectroscopy, fluorescence spectroscopy, absorbance spectroscopy, NMR spectroscopy, calorimetry, 
and proteolysis. As will be appreciated by those skilled In the art, the kinetic parameters of the folding 

25 and unfolding transitions may also be monitored using these and other techniques. The solubility and 
overall structural integrity of an antibody may be quantitatively or qualitatively determined using a wide 
range of methods that are known in the art. Methods which may find use in the present invention for 
characterizing the biophysical properties of antibody library members include gel electrophoresis, 
chromatography such as size exclusion chromatography and reversed-phase high performance liquid 

30 chromatography, mass spectrometry, ultraviolet absorbance spectroscopy, fluorescence 

spectroscopy, circular dichroism spectroscopy, isothermal titration calorimetry. differential scanning 
calorimetry, analytical ultra-centrifugation, dynamic light scattering, proteolysis, and cross-linking, 
turbidity measurement, filter retardation assays, immunological assays, fluorescent dye binding 
assays, protein-staining assays, microscopy, and detection of aggregates via ELISA. Structural 

35 analysis employing X-ray crystallographic techniques and NMR spectroscopy may also find use. In 
one embodiment, antibody stability and/or solubility may be measured by determining the amount of 
antibody in solution after some defined period of time. In this assay, the antibody may or may not be 
exposed to some extreme condition, for example elevated temperature, low pH, or the presence of 
denaturant. Because antibody function typically requires a stable, soluble, and/or well- 

40 folded/structured antibody, the functional (i.e. binding) assays described above also provide a way to 
perform such an assay. For example, a solution comprising an antibody variant could be assayed for 
its ability to bind antigen, then exposed to elevated temperature for one or more defined periods of 
time, then assayed for antigen binding again. Because unfolded and aggregated antibody is not 
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5 expected to be capable of binding antigen, the amount of antibody activity remaining provides a 
measure of the antibody variant's stability and solubility. 

In Vivo or Cell-based Assays 

In a preferred embodiment, the library is screened using one or more cell-based or in vivo-based 

10 assays. Cell types for such assays may be prokaryotic or eukaryotic. For such assays, antibody 

library members, purified or unpurified, are typically added exogenously such that cells are exposed to 
individual variants or pools of variants belonging to a library. These assays are typically, but not 
always, based on the function of the antibody, that is the ability of the antibody to bind an antigen 
and/or some protein which naturally binds the antibody, for example an Fc receptor. Such assays 

15 often involve monitoring the response of cells to antibody, for example cell survival, cell death, change 
in cellular morphology, or transcriptional activation such as cellular expression of a natural gene or 
reporter gene. For example, anti-cancer antibodies may cause apoptosis of certain cell lines 
expressing the antibody's target antigen, or they may mediate attack on target cells by immune cells 
which have been added to the assay. Methods for monitoring cell death or viability are known in the 

20 art, and include the use of dyes, immunochemical, cytochemical, or radioactive reagents. For 

example, caspase staining assays may enable apoptosis to be measured, and uptake of radioactive 
substrates or the dye alamar blue may enable cell growth or activation to be monitored. 
Transcriptional activation may also serve as a method for assaying antibody function in cell-based 
assays. In this case, response may be monitored by assaying for natural genes or proteins which 

25 may be upregulated, for example the release of certain interleukins may be measured, or alternatively 
readout may be via a reporter construct. Cell-based assays may also involve the measure of 
morphological changes of cells as a response to the presence of an antibody library variant. 

Alternatively, cell-based screens are performed directly using cells that have been transformed or 
30 transfected with nucleic acids encoding antibody library members. That is, antibody library variants 
are not added exogenously to the cells. For example, in one embodiment, the cell-based screen 
utilizes cell surface display. A fusion partner can be employed that enables display of antibodies on 
the surface of cells (Witrrup, 2001, Curr. Opin. Biotechnol., 12:395-399). Cell surface display 
methods which may find use in the present invention include but are not limited to display on bacteria 
35 (Georgiou et a/., 1997, Nat Biotechnol. 15:29-34.; Georgiou ef a/., 1993, Trends Biotechnol. 11:6-10; 
Lee et a/., 2000, Nat Biotechnol. 18:645-648; Jun et a/, 1998, Nat Biotechnol. 16:576-80.), yeast 
(Boder & Wittrup, 2000, Methods Enzymol. 328:430-44; Boder & Wittrup, 1997, Nat Biotechnol. 
15:553-557), and mammalian cells (Whitehom et a/, 1995, Bio/technology 13:1215-1219). In an 
alternate embodiment, antibodies are not displayed on the surface of cells, but rather are screened 
40 intracellularly or in some other cellular compartment. For example, periplasmic expression and 
cytometric screening (Chen ef a/, 2001, Nat Biotechnol., 19: 537-542), the protein fragment 
complementation assay (Johnsson & Varshavsky, 1994, Proc. Natl. Acad. Sci. USA, 91: 10340- 
10344.; Pelletier et a/., 1998, Proc. Natl. Acad. Sci.USA 95:12141-12146), and the yeast two hybrid 
screen (Fields & Song, 1989, Nature 340:245-246) may find use in the present invention. 
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5 Alternatively, if the antibody imparts some selectable growth advantage to a cell, this property may be 
used to screen or select for antibody variants. 

The biological properties of one or more antibody library members, including clinical efficacy, 
pharmacokinetics, and toxicity, may also be characterized in cell, tissue, and whole organism 
10 experiments. 

Screening Based on Selection Methods 

As is known in the art, a subset of screening methods are those that select for favorable members of 
a library. Said methods are herein referred to as "selection methods", and these methods find use in 

15 the present invention for screening antibody libraries. When antibody libraries are screened using a 
selection method, only those members of a library which are favorable, that is which meet some 
selection criteria, are propagated, isolated, and/or observed. As will be appreciated, because only the 
most fit antibody variants are observed, such methods enable the screening of libraries which are 
larger than those screenable by methods which assay the fitness of library members individually. 

20 Selection is enabled by any method, technique, or fusion partner which links, covalently or 

noncovalently, the phenotype of an antibody variant with its genotype, that is the function of an 
antibody with the nucleic acid that encodes it. For example the use of phage display as a selection 
method is enabled by the fusion of library members to the gene III protein. In this way, selection or 
isolation of antibody proteins which meet some criteria, for example binding affinity for antigen, also 

25 selects for or isolates the nucleic acid which encodes it Once isolated, the gene or genes encoding 
library antibody variants may then be amplified. This process of isolation and amplification, referred to 
as panning, may be repeated, allowing favorable antibody variants in the library to be enriched. 
Nucleic acid sequencing of the attached nucleic acid ultimately allows for gene identification. 

30 A variety of selection methods are known in the art which may find use in the present invention for 
screening antibody libraries. These include but are not limited to phage display (Phage display of 
peptides and proteins: a laboratory manual . Kay et a/., 1996, Academic Press, San Diego, CA; 
Lowman et a/., 1991, Biochemistry 30:10832-10838; Smith, 1985, Science 228:1315-1317) and its 
derivatives such as selective phage infection (Malmborg et a/., 1997, J. Mol. Biol. 273:544-551), 

35 selectively infective phage (Krebber et al., 1997, J. Mol. Biol. 268:619-630), and delayed infectivity 
panning (Benhar et a/., 2000, J. Mol. Biol. 301 :893-904), cell surface display (Witrrup, 2001 , Cum 
Opin. Biotechnol., 12:395-399) such as display on bacteria (Georgiou et a/., 1997, Nat. Biotechnol. 
15:29-34.; Georgiou ef a/., 1993, Trends Biotechnol. 11:6-10; Lee et a/., 2000, Nat. Biotechnol. 
1 8:645-648; Jun et a/, 1998, Nat. Biotechnol. 16:576-80), yeast (Boder & Wittrup, 2000, Methods 

40 Enzymol. 328:430-44; Boder & Wittrup, 1997, Nat Biotechnol. 15:553-557), and mammalian cells 
(Whitehorn ef a/, 1995, Bio/technology 13:1215-1219), as well as in vitro display technologies 
(Amstutz et a/., 2001 , Cum Opin. Biotechnol. 12:400-405) such as polysome display (Mattheakis ef 
a/., 1994, Proc Natl. Acad. Sci. USA 91:9022-9026), ribosome display (Hanes et al, 1997, Proc. Natl. 
Acad. Sci. USA 94:4937-4942), mRNA display (Roberts & Szostak, 1997, Proc. Natl. Acad. Sci. USA 
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5 94:12297-12302; Nemoto ef a/, 1997, FEBS Lett. 414:405-408), and ribosome-inactivation display 
system (Zhou ef a/., 2002, J. Am. Chem. Soc. 124, 538-543) 

Other selection methods which may find use In the present Invention Include methods that do not rely 
on display, such as in vivo methods including but not limited to periplasmic expression and cytometric 

10 screening (Chen ef a/, 2001, Nat Biotechnol., 19: 537-542), the protein fragment complementation 
assay (Johnsson & Varshavsky, 1994, Proc. Natl. Acad. Sci. USA, 91: 10340-10344; Pelletier ef a/., 
1998, Proc. Natl. Acad. Sci.USA 95:12141-12146), and the yeast two hybrid screen (Fields & Song, 
1989, Nature 340:245-246) used in selection mode (Visintin ef a/., 1999, Proc. Natl. Acad. Sci. USA 
96: 1 1723-1 1728). In an alternate embodiment, selection is enabled by a fusion partner which binds 

15 to a specific sequence on the expression vector, thus linking covalently or noncovalentfy the fusion 
partner and associated antibody library member with the nucleic acid that encodes them. In an 
alternative embodiment, in vivo selection can occur if expression of the library antibody imparts some 
growth, reproduction, or survival advantage to the cell. 

20 As is known in the art, a subset of selection methods referred to as "directed evolution methods" are 
those that include the mating or breading of favorable sequences during selection, sometimes with the 
incorporation of new mutations. As will be appreciated by those skilled in the art, directed evolution 
methods can facilitate identification of the most favorable sequences in a library, and can increase the 
diversity of sequences that are screened. A variety of directed evolution methods are known in the art 

25 that may find use in the present invention for screening antibody libraries, including but not limited to 
DNA shuffling (WO 00/42561 A3; WO 01/70947 A3), exon shuffling (US 6365 377 B1; Kolkman & 
Stemmer, 2001, Nat. Biotechnol. 19:423-428), family shuffling (Crameri ef a/., 1998, Nature 391:288- 
291; US 6376246 B1), RACHITT™ (Coco ef a/., 2001, Nat Biotechnol. 19:354- 359; WO 02/06469 
A2), STEP^nd random priming of in vitro recombination (Zhao ef a/., 1998, Afaf. Biotechnol. 16:258- 

30 261 ; Shao ef a/., 1998, Nucleic Acids Res. 26:681-683), exonuclease mediated gene assembly (US 
6352842 B1; US 6361974 B1), Gene Site Saturation Mutagenesis™ (US 6358709 B1), Gene 
Reassembly™ (US 6358709B1), SCRATCHY (Lutz ef a/., 2001, Proc. Natl. Acad. Sci. USA 98:11248- 
11253), DNA fragmentation methods (Kikuchi ef a/., Gene 236:159-167), and single-stranded DNA 
shuffling (Kikuchi etai, 2000, Gene 243:133-137), all of which are herein expressly incorporated by 

35 reference. 

Design Strategies 

A variety of computational screening design strategies are provided for optimization of the physico- 
chemical properties of antibodies, including stability, solubility, and antigen binding affinity. These 
40 strategies can be used individually or in combination. 

Stability Optimization 

There is frequently a need to enhance the stability of an antibody. Lower stability of a full-length 
antibody or an antibody fragment may result in greater amount of nonnative and thus nonfunctional 
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5 species, increased susceptibility to degradation, and greater tendency for aggregation. Increased 
degradation and aggregation may result in lower in vivo half-life of the molecule if the antibody is a 
therapeutic, further decreasing activity. 

In one object of the present invention, computational screening methodology is used to enhance the 
10 stability of an antibody. A number of design strategies are disclosed for antibody stabilization, 
including strategies which employ experimental Information and/or sequence information to guide 
choice of variable positions, choice of amino acids considered at those positions, and/or generation of 
one or more experimental libraries from computational output. The disclosed design strategies are 
not meant to constrain the present invention to any particular application or theory of operation. 
15 Rather, the present invention relates as novel not only these provided individual strategies, but the 
general use of computational screening to enhance the stability of antibodies. 

The stability of an antibody is comprised of: a) the stabilities of each Individual Ig domain which make 
up the antibody, and b) the stabilities or affinities of interdomain interactions if the antibody is 
20 composed of more than one Ig domain. Thus two main strategies for utilizing computational 

screening methodology to stabilize antibodies are to enhance the stability of individual Ig domains, 
and enhance interface stability between individual ig domains. 

Domain Stability 

25 The stability of an antibody is determined in part by the individual stabilities of each of the Ig domains 
that comprise it. In one embodiment, computational screening is used to stabilize an antibody by 
enhancing the stability of one or more individual Ig domains. In this embodiment, more favorable 
interactions are designed within one or more individual Ig domains, thereby increasing the global 
- stability of the antibody as a whole. For an antibody which is made up of more than one Ig domain, 

30 each individual Ig domain may be engineered for greater stability. Thus for example, for antibodies 
derived from human, mouse, rat, or rabbit antibodies, the stability may be improved by stabilizing one 
or more of domains V H , V L , Cy1 , C Ll Cy2, and Cy3. 

In one embodiment, the interior of an Ig domain or Ig domains are redesigned to be more stable. For 
35 example, as will be appreciated by those skilled in the art, the van der Waals packing interactions 
between nonpolar residues in the core play an important role protein stability. Mutations may be 
designed that result in more favorable interactions between interior residues. In another embodiment, 
non-interior residues, that is boundary or surface positions an Ig domain or domains are designed to 
be more stable. For example, greater stability may be gained when amino acid side chains which 
40 have the capacity to donate a hydrogen bond are interacting with a molecule which is capable of 
accepting a hydrogen bond, whether this molecule be another side chain, the protein backbone, or 
solvent. Interior and non-interior residues may be identified by objective methods such as degree of 
solvent exposure, as described above, subjective methods such as visual inspection by one skilled in 
the art of protein structural biology, or other methods. As described above, variable positions and 
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amino acids considered at those positions may be chosen using any variety of approaches, including 
but not limited to approaches based on solvent exposure, approaches which are hypothesis-driven, 
approaches which utilize experimental information, approaches which utilize sequence information, or 
any combination of these and other approaches. 

A number of examples are provided below which describe the use of computational screening 
methods to stabilize the Ig domains of an antibody. These examples are not meant to constrain the 
present invention to any particular application or theory of operation. Rather, the present invention 
relates as novel not only these provided individual examples, but the general use of computational 
screening methodology to enhance the stability of an Ig domain or Ig domains in order to optimize an 
antibody for greater stability. 

Interface Stability 

The stability of multi-lg domain antibodies, that is to say full-length antibodies and antibody fragments 
which are composed of more than one Ig domain, are determined in part by the affinities of the 
interactions between domains (Worn & Pluckthun. 2001. J. Moi. Bid. 305:989-1010). Two interacting 
Ig domains exist in equilibrium between bound and unbound states. In the unbound state, Ig domains 
have a greater tendency to unfold and aggregate than when they are in the bound state. Thus by 
designing more favorable interactions between residues that mediate the interdomain interaction, the 
bound state may be stabilized, thereby stabilizing the antibody as a whole. In one embodiment of the 
present invention, computational screening Is used to engineer mutations that result in more favorable 
interactions between individual Ig domains. As shown in Figure 1, for human antibodies there are five 
interdomain interfaces that may be optimized using computational screening methodology: V H -V L , 
Cy1 -C L , V H -Cy1 . V U -C L . and Cy3-Cy3. The stability of a Fab is dependent on the interactions at only a 
subset of these interfaces: Vh-V l . Cy1-C L . V H -Cy1. and V L -C L . 

Greater interdomain stability may be obtained by engineering more energetically favorable 
interactions between residues that mediate the interdomain interface. Such designed interactions 
could involve more favorable packing interactions, hydrogen bond interactions, electrostatic 
interactions, hydrophobic interactions, and the like. Interface residues may be identified by objective 
methods such as degree of solvent exposure, as described above, subjective methods such as visual 
inspection by one skilled in the art of protein structural biology, or other methods. As described 
above, variable positions and amino acids considered at those positions may be chosen using any 
variety of approaches, including but not limited to approaches based on solvent exposure, 
approaches which are hypothesis-driven, approaches which utilize experimental information, 
approaches which utilize sequence information, or any combination of these and other approaches. 

In one embodiment, the interface is designed to have more favorable nonpolar interactions, for 
example by engineering the interface with more nonpolar volume than that in the antibody template, 
by designing nonpolar residues which pack better together than that in the antibody template, and the 
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5 like. As will be appreciated by those skilled in the art, this may be thought of as the interface version 
of a redesigned hydrophobic core. Here, however, variable positions are those that make up the 
interface between Ig domains instead of the core of an Ig domain. In an alternate embodiment, the 
Interface is designed to have more favorable polar interactions, for example by engineering the 
interface with more polar amino acids than that in the antibody template, by designing nonpolar 

10 residues with more optimized hydrogen bonds, electrostatic interactions, and the like. As will 
appreciated by those in the art, greater polar character at the interface may enable the 
bound/unbound equilibrium between Ig domains to be more reversible. In the unbound state, the 
residues which make up the interface with the other Ig domain and are normally sequestered from 
solvent become exposed to solvent. Nonpolar residues have a higher tendency to aggregate than 

1 5 polar residues, and therefore greater nonpolar character at the interdomain interface may result in a 
greater tendency to aggregate in the unbound form, resulting in non-reversibility of the 
unbinding/binding transition. Irreversible aggregation means that the antibody cannot get back to its 
native bound state (i.e. the Ig domain interface is not reformed). This property of Ig domain interfaces 
in antibodies is supported experimentally (Worn & PIQckthun, 2001, J. MoL Biol. 305:989-1010; Ewert 

20 et al. t 2002, Biochemistry, 41:3628-3636). In an alternate embodiment, the interface is engineered 
with more favorable nonpolar and polar interactions. 

A number of examples are provided below in which describe the use of computational screening 
methods to stabilize the interfaces between Ig domains. These examples illustrate how a variety of 
25 interactions may be designed at interdomain interfaces that result in greater stability. These 

■ 

examples are not meant to constrain the present invention to any particular application or theory of 
operation. Rather, the present invention relates as novel not only these provided individual examples, 
but the general use of computational screening methodology to design more energetically favorable 
inter-lg domain interactions in order to stabilize an antibody. 

30 

Solubility Optimization 

There is frequently a need to enhance the solubility of an antibody. Lower solubility of an antibody 
may result in a greater fraction of nonfunctional species, increased susceptibility to degradation, and 
shorter in vivo half-life and lower efficacy if the antibody is a therapeutic. Poor solubility may also 
35 place severe constraints on antibody formulation and route of administration. A number of design 
strategies are suggested for using computational screening methods to enhance the solubility of an 
antibody, all of which are embodiments of the present invention. 

In one embodiment, surface exposed nonpolar residues in an antibody are replaced with polar 
40 residues which are predicted by computational screening calculations to be favorable. Underlying this 
strategy is the principle that polar residues are more soluble than nonpolar ones. This principle is well 
known in the art. In regard to which residues are more polar or nonpolar than others, such a 
judgment may be made subjectively or objectively. Subjectively, for example, one skilled in the art of 
protein structural biology appreciates qualitatively that amino acids such as leucine, tryptophan, and 
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5 methionine are more nonpolar, and thus potentially more prone to cause aggregation when exposed 
to solvent, than amino acids such as serine, asparagine, and glutamate. Objective and quantitative 
measurements of hydrophobicity are also known in the art. For example, the free energies of transfer 
of an amino acid from non-aqueous to aqueous solution have been used to generate relative rankings 
of amino acid hydrophobicity, and such methods find use in the present invention. Variable positions 
10 and amino acids considered at those positions may be chosen using any variety of approaches, as 
described above, including but not limited to approaches based on solvent exposure, approaches 
which are hypothesis-driven, approaches which utilize experimental information, approaches which 
utilize sequence information, or any combination of these and other approaches. 

15 A number of strategies for replacing exposed nonpolar amino acids find use in the present invention. 
In one embodiment, residues which may be replaced Include residues which are exposed to solvent 
on individual Ig domains, or which lie at the interface between Ig domains. In this regard, all Ig 
domains of a human antibody, including V Ht V Ll Cy1, C L , Cy2, and Cy3, as well as the linkers and/or 
hinges which connect them, have surface residues which could be replaced with amino acids which 

20 may impart greater solubility to the antibody. In another embodiment, variable positions reside in a 
region of an antibody fragment which in the context of a full-length antibody or larger antibody 
fragment makes up the interface with another Ig domain. As will be appreciated by those skilled in the 
art, antibody fragments are generated by removing certain regions or domains of an antibody. As a 
result, regions of an Ig domain which interact with another Ig domain in the larger antibody may 

25 become exposed to solvent in the context of an antibody fragment. For example, the Vh and V L 

residues which make up the Vn/Cyl and V L /C L interfaces of an antibody are exposed to solvent in an 
scFv fragment of that antibody (Nieba et a/., 1997, Protein Eng. 10:435-44). The result for an scFv, or 
any other antibody fragment, may be increased propensity for aggregation and thus lower solubility. 
Computationar screening methods may be used to engineer mutations at these positions which result 

30 in greater solubility of the antibody fragment. 

Several additional strategies may also be used to optimize solubility. For example, it is known in the 
art that protein solubility is typically lowest when the pH of the solution is equal to the isoelectric point 
(pi) of the protein. Under such conditions, the net charge of the protein is equal to zero. It is possible 

35 to optimize solubility by altering the number and location of ionizable residues in the antibody to adjust 
the pi. In other cases, improvements in solubility may result from optimizing the stability of the 
antibody, as discussed above. As is well known in the art, proteins are much more prone to 
aggregation in unfolded or partially folded states. Thus proteins that are well folded, structured, 
and/or stable are typically more soluble. Accordingly, computational screening which stabilizes an 

40 antibody, for example by one or more design strategies discussed above, may also be used to 

enhance antibody solubility. Additionally, if the antibody contains one or more cysteines that do not 
form disulfide bonds in the native antibody structure, replacing such cysteines with less reactive, 
structurally compatible residues can prevent the formation of unwanted intra- and inter-molecular 
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5 disulfide bonds. As will be appreciated by those skilled in the art, additional strategies could also be 
used to optimize the solubility of antibodies. 

Affinity Maturation 

There is frequently a need to enhance the affinity of an antibody for its antigen. This process is 
10 referred to as affinity maturation, and following this process, the antibody may then be said to be 

affinity matured. The binding affinity of an antibody for its target is a critical parameter for its success 
as a therapeutic, diagnostic, or reagent. Higher affinity for antigen may result in a more efficacious 
antibody therapeutic. As discussed above, enhancement of antigen affinity is frequently wanted or 
needed for a variety of forms and sources of antibodies such as those that are substantially human, 
1 5 nonhuman, chimeric, or humanized. A particular case which demands affinity maturation is 

subsequent to humanization. As discussed above, this technique to reduce the immunogenicity of 
antibody therapeutics often results in loss of binding affinity for antigen, and thus regaining this affinity 
is typically desired. 

20 Computational screening methods may be applied to antibody affinity maturation using a number of 
design strategies, all of which are embodiments of the present invention. Strategies for affinity 
maturation include but are not limited to those which use only a structure or structures of bound 
antibody/antigen complexes, only a structure or structures of unbound antibodies, or structures of 
both bound and unbound antibody. These strategies need not be defined by the structural information 

25 that is available, but rather may be defined by the structural information that is employed. For 
example, to affinity mature an antibody it may be useful to carry out design calculations on an 
unbound antibody template that is a structure of the antibody alone without antigen, even though a 
structure of the antibody/antigen complex may be available. The structure of the unbound antibody 
may be available, or could be obtained by deleting antigen coordinates from the structure of the 

30 complex. 

As discussed above, antibody templates may be obtained from a variety of sources, including but not 
limited to X-ray crystallographic techniques, NMR techniques, de novo modeling, and homology 
modeling. Antibody/antigen complexes may furthermore be obtained using docking methods. For 

35 example, if the antibody/antigen complex structure is not available, it may be modeled by docking the 
antigen into the antibody variable region. Methods for this process are known in the art. Variable 
positions and amino acids considered at those positions may be chosen using any variety of 
approaches, as described above, including but not limited to approaches based on solvent exposure, 
approaches which are hypothesis-driven, approaches which utilize experimental information, 

40 approaches which utilize sequence information, or any combination of these and/or other approaches. 

In one embodiment, computational screening is used to affinity mature an antibody by using the 
structure of a bound antibody/antigen complex as the template for design calculations. In this 
strategy, one or more antibody mutations are design that result in more favorable interactions (i.e., 
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5 higher affinity) between the antibody and its antigen. In one embodiment, only antibody residues 
which directly contact antigen, referred to herein as "contact residues" are allowed to vary in design 
calculations. In an alternate embodiment, variable antibody positions may include residues which do 
not contact antigen, alone or in addition to residues which do contact antigen. For example, the 
variable positions in a design calculation could be set to those residues which interact with contact 

10 residues, but are not themselves contact residues. As will be appreciated by those skilled in the art, 
the subtle conformations of contact residues which are optimal for antigen binding are determined in 
part by the conformations of the surrounding residues. By using computational screening to explore 
substitutions in the shell of residues which interact with contact residues, a quality diversity of new 
contact residue conformations may be sampled. In an alternate embodiment, contact residues and 

1 5 residues which are not contact residues are variable positions in design calculations. 

In another embodiment, computational screening is used to affinity mature an antibody by using the 
structure of an uncomplexed antibody structure, i.e. a structure of an antibody which is not bound to 
its antigen, as the template for design calculations. In this strategy, antibody residues which contact 

20 antigen or which are believed to contact antigen are mutated to residues which are energetically 
favorable in the context of the structural template. The primary goal of this approach is to generate 
quality diversity within an experimental library such that the distribution within the library is skewed 
towards a larger percentage of variants which are energetically compatible with the antibody than 
would be expected if variants were designed randomly. Although the antibody variants in this library 

25 are not directly computationally screened to possess higher affinity for antigen, such variants will likely 
still be present in the library. The use of computational screening enables the vast sequence space of 
mutations which are inconsistent with the antibody structure to be trimmed from the library, thereby 
increasing the chances of finding in an experimental screen those variants which possess higher 
antigen binding affinity. In the absence of an antibody/antigen complex structure, it is not possible to 

30 identify contact residues by visual inspection. Thus, experimental and sequence information are 
particularly useful in this case, as these may provide insight into which residues are important 
determinants of antigen binding. 

In another embodiment, computational screening methods are used to affinity mature an antibody by 
35 combining results from design calculations which use the structures of both a bound antibody/antigen 
complex and an unbound antibody structure as templates for design calculations. In one 
embodiment, computational screening is used to engineer mutations at or near the antibody/antigen 
interface that are energetically favorable in the context of both the bound and unbound antibody 
structures. For this strategy, output from two sets of design calculations could be used to generate an 
40 experimental library. For example, one set of calculations could involve those which use one or more 
unbound antibody structures as the template(s), and another set of calculations could use one or 
more bound antibody/antigen structures as the template(s). The experimental library could be 
comprised of variants which are predicted to be energetically favorable in both sets of calculations. In 
one embodiment, variants which are predicted to be energetically favorable in both structures are 
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5 included in the library. In an alternate embodiment, variants which are predicted to be energetically 
favorable in at least one of the structures are included in the library. As is illustrated in the examples 
below, it is a preferred embodiment to have at least one of the variable regions located In a framework 
region, a complementarity determining region or a combination of both regions. 

10 A number of examples are provided below which describe the use of computational screening to 
affinity mature antibodies. These examples are not meant to constrain the present invention to any 
particular application or theory of operation. Rather, the present invention relates as novel not only 
these provided individual examples, but the general use of computational screening methods to 
affinity mature antibodies. 

15 

EXAMPLES 



A number of examples are provided below to illustrate implementation of the design strategies 
discussed above to optimize antibodies. These examples employ a variety of strategies, approaches, 

20 methods, and so forth to choose variable positions, choose amino considered at those positions, 

calculate energies, search sequence space using optimization algorithms, and generate experimental 
libraries. Libraries generated from these examples could be subsequently screened experimentally to 
obtain optimized antibody variants, become part of other libraries which could be subsequently 
screened experimentally, or serve other purposes. These examples are not meant to constrain the 

25 present invention to any particular application or theory of operation. Rather, the present invention 
relates as novel not only to these provided individual examples, but the general use of computational 
screening to enhance antibody stability, improve antibody solubility, and increase the affinity of 
antibodies for antigen. 

30 Figure 3 shows a list of the antibody structures which are used as templates in the provided 

examplesr Unless otherwise noted, the groups of core, surface, and boundary for choice of amino 
acids considered at variable positions are composed of the following sets of amino acids: core = 
alanine, valine, isoleucine, leucine, phenylalanine, tyrosine, tryptophan, and methionine; surface = 
alanine, serine, threonine, aspartic acid, asparagine, glutamine, glutamic acid, arginine, lysine and 

35 histidine; boundary = alanine, serine, threonine, aspartic acid, asparagine, glutamine, glutamic acid, 
arginine, lysine, histidine, valine, isoleucine, leucine, phenylalanine, tyrosine, tryptophan, and 
methionine; All or All 20 = all 20 natural amino acids. 



Stability Optimization 

40 As discussed above, two main strategies for utilizing computational screening methodology to 
stabilize antibodies are to enhance the stability of individual Ig domains, and enhance interface 
stability between individual Ig domains. 
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5 Domain Stability 

The stability of an antibody can be Increased by designing more favorable Interactions within one or 
more individual Ig domains. For an antibody which is made up of more than one Ig domain, each 
individual Ig domain can be engineered for greater stability. Thus for example, for a human, mouse, 
rat, or rabbit antibody, stability can be improved by stabilizing one or more of domains Vh, V l , Cy1, C L , 
10 Cy2, and Cy3. 

Example 1: Campath V H Domain Stabilization 

The heavy chain variable domain (V H ) of Campath was stabilized using computational screening 
methods to design more favorable interactions within the interior of the protein. Campath is a 

15 humanized antibody that is currently marketed for treatment for B-cell chronic lymphocytic 

leukemia. The high resolution structure is available of the complex of the Campath Fab with its target 
antigen, a peptide from the cell surface protein CD52. This structure, PDB accession code 1CE1, 
served as the template for design calculations. The V H domain of Campath, and most antibodies, has 
an extensive interior which is critical to its stability. This interior can be thought of as being made up 

20 of two separate hydrophobic cores which are separated by the central disfulfide bond. These cores 
are referred to as the upper core and lower core, with the directional distinction being defined when 
the CDRs are facing upward as shown in Figure 4. As will be appreciated by those skilled in the art, 
packing interactions between the hydrophobic residues which make up these cores play a key role in 
V H stability, and thus in the stability of any antibody to which V H belongs. Computational screening 

25 was applied to design more stable packing interactions in the V H lower core. Variable positions were 
chosen by visual inspection of the 1CE1 structure, and these positions are shown in Figure 4 and 
listed in Figure 5a. Because these positions are almost completely sequestered from solvent, the 
amino acids considered were chosen as the set belonging to the core classification. The 
conformations "of amino acids at variable positions were represented as a set of backbone- 

30 independent side chain rotamers derived from the rotamer library of Dunbrack & Cohen (Dunbrack & 
Cohen, 1997, Protein Science 6:1661-1681). 

The energies of all possible combinations of the considered amino acids at the chosen variable 
positions were calculated using a force field containing terms describing van der Waals, solvation, 

35 electrostatic, and hydrogen bond interactions, and the optimal (ground state) sequence was 

determined using a DEE algorithm. This ground state, and the WT Campath sequence, are shown in 
Figure 5a. The fact that the ground state is very similar to the WT sequence validates the 
computational screening method. As will be appreciated by those in the art, the predicted lowest 
energy sequence is not necessarily the true lowest energy sequence because of errors, primarily in 

40 the scoring function, coupled with the fact that subtle conformational differences in proteins can result 
in dramatic differences in stability. However, the predicted ground state sequence is likely to be close 
to the true ground state, and thus this problem can be hedged by screening variants close in 
sequence space and in energy around the predicted ground state. Towards this goal, in order to 
generate a diversity of sequences for an experimental library, a Monte Carlo algorithm was used to 

57 



WO 03/074679 



PCT/US03/06598 



5 evaluate the energies of 1000 similar sequences around the predicted ground state. Figure 5a shows 
the output sequence lists from this Monte Carlo search. 

These results can be used to generate one or more experimental libraries which can be screened for 
increased antibody stability. As discussed above, there are a variety of ways to generate an 

10 experimental library. Library 1, shown in Figure 5b is a defined library of just the ground state 

sequence. Library 2, shown in Figure 5c, is a combinatorial library in which a 1% cutoff of occupancy 
has been applied to the Monte Carlo output, that is to say that only amino acid substitutions which 
occur in 10 or greater variants out of the 1 000 Monte Carlo output sequences are included in the 
library. Because valine does not occur at heavy chain position 1 1 7 in the Monte Carlo output, the WT 

1 5 sequence is not represented. It may be judicious to include this valine at 1 1 7 H so that the WT amino 
acids are represent combinatorially in library 2. The combination of all of these substitutions with all 
other substitutions results in a combinatorial complexity of 864, i.e. there are 864 possible variants in 
the library. 

* 

20 Example 2: Campath V H Domain Stabilization 

The light chain variable domain (V L ) of Campath was also stabilized by using computational screening 
methods. Like the Vh domain, has an extensive interior which can be thought of as being made up 
of an upper and lower core, separated by the central disfulfide bond, shown in Figure 6. 
Computational screening was applied to design more stable packing interactions in the V L upper core. 

25 Stabilization of the upper core may be less straightforward than the lower core because subtle 

conformational changes to the upper may more directly impact the conformation of the CDRs, and 
thus mutations may affect antigen binding. Variable positions were chosen by visual inspection of the 
1 CE1 structure, and these positions are shown in Figure 6 and listed in Figure 7a. For most variable 
positions, the amino acids conserved were chosen as the set belonging to the core classification 

30 because they are sequestered from solvent. Substitutions at two light chain positions, 92 and 97, 

could potentially make favorable polar Interactions, and so amino acids considered for these positions 
were chosen as the set belonging to the boundary classification. The conformations of amino acids at 
variable-positions were represented as a set of side chain rotamers derived'from a backbone- 
independent rotamer library. 

35 

The CE1 structure was used as the template for design calculations. The energies of all possible 
combinations of the considered amino acids at the chosen variable positions were calculated using a 
force field containing terms describing van der Waals, solvation, electrostatic, and hydrogen bond 
interactions, and the optimal (ground state) sequence was determined using a DEE algorithm. This 
40 ground state, and the WT Campath sequence, are shown in Figure 7a. The fact that the WT 
sequence is predicted to be the ground state validates the computational screening method. A 
diversity of sequences for an experimental library was generated by using a Monte Carlo algorithm to 
evaluate the energies of 1000 similar sequences around the predicted ground state. Figure 7a shows 
the output sequence lists from this Monte Carlo search. 
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5 These results can be used to generate one or more experimental libraries which can be subsequently 
screened for increased antibody stability. An experimental library, shown in Figure 7b, was derived 
from this set of designed calculations by applying a 5% cutoff of occupancy to the Monte Carlo output, 
i.e. only amino acid substitutions which occur in 50 or greater variants out of the 1000 Monte Carlo 
output sequences are included in the library. This combinatorial library has a complexity of 448. 

10 

Example 3: Campath Cyl Domain Stabilization 

The heavy chain constant domain 1 (Cy1) is also important to antibody stability. This domain is a part 
of the antibody constant region, and thus improvements made are widely applicable to antibodies, 
independent of what antigen is bound at the variable region. The Cy1 of Campath was stabilized 

15 using computational screening methods to design more favorable interactions within the interior of the 
protein. Like most immunoglobulin domains, Cy1 has an extensive interior made up of an upper and 
lower core, separated by the central disfulfide bond, shown in Figure 8. Computational screening was 
applied to design more stable packing interaction in the Cy1 upper core. Variable positions were 
chosen by visual inspection of the 1CE1 structure, and these positions are shown in Figure 8 and 

20 listed in Figure 9a. The majority of the chosen core variable positions are sequestered from solvent, 
and therefore the amino acids conserved were chosen as the set belonging to the core classification. 
The exception is heavy chain position 173, substitutions at which could potentially make favorable 
poiar interactions, and so amino acids considered for this position were chosen as the set belonging 
to the boundary classification. The conformations of amino acids at variable positions were 

25 represented as a set of side chain rotamers derived from a backbone-independent rotamer library. 
The CE1 structure was used as the template for design calculations. The energies of all possible 
combinations of the considered amino acids at the chosen variable positions were calculated using a 
force field containing terms describing van der Waals, solvation, electrostatic, and hydrogen bond 
interactions, and the optimal (ground state) sequence was determined using a DEE algorithm. This 

30 ground state, and the WT Campath sequence, are shown in Figure 9a. The fact that the predicted 
ground state sequence is very similar to the WT sequence validates the computational screening 
method. A diversity of sequences for an experimental library was generated by using a Monte Cario 
algorithm to evaluate the energies of 1000 similar sequences around the predicted ground state. 
Figure 9a shows the output sequence \\s\s from this Monte Carlo search. 

35 

These results can be used to generate one or more experimental libraries which can be subsequently 
screened for increased antibody stability. An experimental library, shown in Figure 9b, was derived 
from this set of designed calculations by applying a 5% cutoff of occupancy to the Monte Carlo output, 
i.e. only amino acid substitutions which occur in 50 or greater variants out of the 1000 Monte Carlo 
40 output sequences are included in the library. This combinatorial library has a complexity of 192. 

Example 4: Fc Cy2 Domain Stabilization 

The heavy chain constant domain 2 (Cy2) is also important to antibody stability. This domain is part 
of the antibody Fc region, and thus improvements made are widely applicable to antibodies, 
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5 independent of what antigen is bound at the variable region. The Fc Cy2 domain was stabilized using 
computational screening methods to design more favorable interactions within the interior of the 
protein. The high resolution structure of human Fc has been solved. This structure, PDB accession 
code 1DN2, served as the template for design calculations. Like most immunoglobulin domains, Cy2 
has an extensive interior made up of an upper and lower core, separated by the central disfulfide 

10 bond, shown in Figure 10. Computational screening was applied to design more stable packing 

interactions in the Cy2 upper core. Variable positions were chosen by visual inspection of the 1DN2 
structure, and these positions are shown in Figure 10 and listed in Figure 11a. The majority of the 
chosen core variable positions are sequestered from solvent, and therefore the amino acids 
conserved were chosen as the set belonging to the core classification. The exception is position 332, 

15 substitutions at which could potentially make favorable polar interactions, and so amino acids 
considered for this position were chosen as the set belonging to the boundary classification. The 
conformations of amino acids at variable positions were represented as a set of side chain rotamers 
derived from a backbone-independent rotamer library. 

20 The energies of ail possible combinations of the considered amino acids at the chosen variable 
positions were calculated using a force field containing terms describing van der Waals, solvation, 
electrostatic, and hydrogen bond interactions, and the optimal (ground state) sequence was 
determined using a DEE algorithm. This ground state, and the WT Fc sequence, are shown in Figure 
1 1a. The fact that the predicted ground state sequence is very similar to the WT sequence validates 

25 the computational screening method. A diversity of sequences for an experimental library was 
generated by using a Monte Carlo algorithm to evaluate the energies of 1000 similar sequences 
around the predicted ground state. Figure 11a shows the output sequence lists from this Monte Carlo 
search. 

30 These results can be used to generate one or more experimental libraries which can be screened for 
increased antibody stability. An experimental library, shown in Figure 11b, was derived directly from 
this set of designed calculations, i.e. no cutoff criteria were applied. This combinatorial library has a 
complexity of 336. 

35 Example 5: Fc C/3 Domain Stabilization 

The heavy chain constant domain 3 (Cy3) is also important to antibody stability. This domain is part 
of the antibody Fc region, and thus improvements made are widely applicable to antibodies, 
independent of what antigen is bound at the variable region. The Fc Cy3 domain was stabilized by 
using computational screening methods to design more favorable interactions within the interior of the 

40 protein. Like most immunoglobulin domains, Cy2 has an extensive interior made up of an upper and 
lower core, separated by the central disfulfide bond, shown in Figure 12. Computational screening 
was applied to design more stable packing interaction in the Cy3 lower core. Variable positions were 
chosen by visual inspection of the 1DN2 structure, and these positions are shown in Figure 12 and 
listed in Figure 13a. The majority of the chosen core variable positions are sequestered from solvent, 
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5 and therefore the amino acids conserved were chosen as the set belonging to the core classification. 
The exceptions are positions 358 and 391 , substitutions at which could potentially make favorable 
polar interactions, and so amino acids considered for these positions were chosen as the set 
belonging to the boundary classification. The conformations of amino acids at variable positions were 
represented as a set of side chain rotamers derived from a backbone-independent rotamer library. 

1 0 1 DN2 was used as the structural template for design calculations. The energies of all possible 

combinations of the considered amino acids at the chosen variable positions were calculated using a 
force field containing terms describing van der Waals, solvation, electrostatic, and hydrogen bond 
interactions, and the optimal (ground state) sequence was determined using a DEE algorithm. This 
ground state, and the WT Fc sequence, are shown in Figure 13a. The fact that the predicted ground 

1 5 state sequence is very similar to the WT sequence validates the computational screening technology. 
A diversity of sequences for an experimental library was generated by using a Monte Carlo algorithm 
to evaluate the energies of 1000 similar sequences around the predicted ground state. Figure 13a 
shows the output sequence lists from this Monte Carlo search. 

20 These results can be used to generate one or more experimental libraries which can be screened for 
increased antibody stability. An experimental library, shown in Figure 13b, was derived from this set 
of designed calculations by applying a 1% cutoff of occupancy to the Monte Carlo output, i.e. only 
amino acid substitutions which occur in 10 or greater variants out of the 1000 Monte Carlo output 
sequences are included in the library. This combinatorial library has a complexity of 336. 

25 

Interface Stability 

The stability of an antibody can be increased by designing more favorable interactions between 
individual Ig domains at inter-lg domain interfaces. For example, as can be seen in Figure 1 , for 
human antibodies there are five interdomain interfaces that can be optimized using computational 
30 screening methodology: VhA/ l , Cy1/C L , Vh/Cy1, V l /C l , and Cy3/Cy3. 

Example 6: rhumAb VEGF Vh/V l Interface Stabilization 

The stability of the interface between the V H and V L domains is critical to antibody stability. The 
antibody rhumAb VEGF was stabilized by enhancing the interaction between the V H and V L domains 

35 by using computational screening methods to design more favorable interactions between the 

residues which make up this interface. rhumAb VEGF is a humanized antibody that is currently in 
clinical development for treatment of a variety of cancers. The high resolution structure is available of 
the complex of the rhumAb VEGF Fab fragment with its target antigen, the vascular endothelial 
growth factor (VEGF). This structure, PDB accession code 1CZ8, served as the template for design 

40 calculations. The V H / V L interface of rhumAb VEGF is shown in Figure 14. Variable positions were 
chosen by visual inspection of the 1CZ8 structure, and these positions are shown in Figure 14 and 
listed in Figures 15a and 15b. For rhumAb VEGF, the interface can be separated into two somewhat 
independent sets of residues, and thus it was possible to carry out computational screening in two 
separate sets of design calculations. The sets of amino acids considered at variable positions were 
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5 chosen subjectively by visual inspection of the 1 CZ8 structure. The conformations of amino acids at 
variable positions were represented as a set of side chain rotamers derived from a backbone- 
Independent rotamer library. 

The 1CZ8 structure was used as the template for design calculations. For both sets of calculations, 
10 the energies of all possible combinations of the considered amino acids at the chosen variable 
positions were calculated using a force field containing terms describing van der Waals, solvation, 
electrostatic, and hydrogen bond interactions, and the optimal (ground state) sequences were 
determined using a DEE algorithm. These ground states, and the WT rhumAb VEGF sequence, are 
shown in Figures 1 5a and 1 5b. The fact that the predicted ground state sequences are very similar to 
1 5 the WT sequence validates the computational screening method. A diversity of sequences for an 

experimental library was generated by using a Monte Carlo algorithm to evaluate the energies of 1000 
similar sequences around the predicted ground states. Figures 1 5a and 1 5b show the output 
sequence lists from these Monte Carlo searches. 

20 These results can be used to generate one or more experimental libraries which can be screened for 
increased antibody stability. An experimental library, shown in Figures 15c, was derived by applying a 
1 % cutoff of occupancy to the Monte Carlo output from each set of calculations, and then these 
primary libraries were subsequently combined to generate a secondary library with mutations at all 
positions. This combinatorial library has a complexity of 1.3 x 10 7 . 

25 

Because of the number of residues involved in mediating this interface, it may be beneficial to reduce 
the complexity of the design calculations. As discussed above, sequence information can be used to 
guide the choice of variable positions and the set of amino acids considered at those positions. The 
use of sequence information here will enable the complexity of the computational problem to be • 

30 reduced while ensuring that the remaining diversity sampled is of high quality, in terms of the 

structural, functional, and immunogenic fidelity of the antibody. Figures 16a and 16b show the 1CZ8 
heavy and light chain variable chain sequences aligned with the human Vh and Vl kappa germ line 
sequences. A new design calculation using this information was run to stabilize the V^/ V u interface. 
The sequence information was first used to reevaluate the list of variable positions. A subset of the 

35 positions in Figures 1 5a and 1 5b were chosen based on the degree of variability at each position in 
the germ line. Those positions with one type of amino acid in the majority of the sequences, or for 
which there is no sequence information, were not allowed to vary in the calculation. This new set is 
shown in Figure 17a. Light chain position 98 and heavy chain positions 45, 110, and 113 were not 
variable positions in this calculation, but were floated. The sequence information was also used to 

40 choose the set of amino acids to be considered at variable positions in the new design calculation. All 
amino acids, and only those amino acids, which appear at each variable position in the germ line were 
considered in the new design calculation. For variable positions in the light and heavy chain CDR3s, 
for which no sequence information is available, all 20 amino acids were considered. This set of 
considered amino acids is shown in Figure 17a. 
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The 1CZ8 structure was used as the template for design calculations. In this new calculation, 
energies of all possible combinations were not precalculated. Instead, a genetic algorithm was used 
to screen for low energy sequences, with energies being calculated during each round of "evolution" 
only for those sequences being sampled. The conformations of amino acids at variable and floated 

10 positions were represented as a set of side chain rotamers derived from a backbone-independent 
rotamer library using a flexible rotamer model (Mendes et a/., 1999, Proteins: Structure, Function, 
and Genetics 37:530-543). Energies were calculated using a force field containing terms describing 
van der Waals, solvation, electrostatic, and hydrogen bond interactions. This calculation generated a 
list of 300 sequences which are predicted to be low in energy. Clustering was performed to facilitate 

15 analysis of the results and library generation. The 300 output sequences were clustered 
computationally into 10 groups of similar sequences using a nearest neighbor single linkage 
hierarchical clustering algorithm to assign sequences to related groups based on similarity scores 
(Diamond, R., Coordinate-Based Cluster Analysis, Acta Cryst. 1995 , D51, 127-135.). That Is, all 
sequences within a group are most similar to all other sequences within the same group and less 

20 similar to sequences in other groups. The lowest energy sequence from each of these ten clusters, 
used here as a representative of each group, is presented in Figure 17a. 

These results can be used to generate one or more experimental libraries which can be subsequently 
screened for increased antibody stability. An experimental library can be derived directly from the 

25 representative cluster group sequences. Thus Figure 17a provides a 10 sequence experimental 
library. To efficiently use experimental resources, this library size of 10 variants could be screened 
first, followed by subsequent screening of sequences or a subset of sequences within the group to 
which the experimentally determined most favorable variant belongs. For example, if variant 5 (I.e. 
the lowest energy sequences from cluster group 5) was found to be most favorable, all of the 

30 sequences of cluster group 5 could be subsequently screened. The 14 sequences in group 5 are 
presented in Figure 17b as an example of such an experimental library. 



Example 7: Herceptin VhA/u interface Stabilization 

The interface between the Vh and Vl domains of the antibody Herceptin was also stabilized. More 
35 favorable interactions between the V H and V L domains were designed using computational screening 
methods. Herceptin, which targets the extracellular domain of the proto-oncogene Her2/neu gene 
product, also known as erbB2, is a humanized antibody that is currently marketed for treatment for 
breast cancer. The high resolution structure is available of uncomplexed Herceptin scFv. This 
structure, PDB accession code 1 FVC, served as the template for design calculations. The Vh/ V l 
40 interface of Herceptin is shown in Figure 18. Variable positions were chosen by visual inspection of 
the 1FVC structure, and these positions are shown in Figure 18 and listed in Figure 19a. The majority 
of the chosen core variable positions are sequestered from solvent, and therefore the amino acids 
conserved were chosen as the set belonging to the core classification. The exception is light chain 
position 43, substitutions at which could potentially make favorable polar interactions, and so amino 
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5 acids considered for this position were chosen as the set belonging to the boundary classification. 
The conformations of amino acids at variable positions were represented as a set of side chain 
rotamers derived from a backbone-independent rotamer library. 

The 1 FVC structure was used as the structural template for design calculations. The energies of all 

10 possible combinations of the considered amino acids at the chosen variable positions were calculated 
using a force field containing terms describing van der Waals, solvation, electrostatic, and hydrogen 
bond interactions, and the optimal (ground state) sequence was determined using a DEE algorithm. 
This ground state, and the WT Herceptin sequence, are shown in Figure 1 9a. The feet that the 
predicted ground state sequence is very similar to the WT sequence validates the computational 

1 5 screening technology. A diversity of sequences for an experimental library was generated by using a 
Monte Carlo algorithm to evaluate the energies of 1000 similar sequences around the predicted 
ground state. Figure 19a shows the output sequence list from this Monte Carlo search. 
These results can be used to generate one or more experimental libraries which can be subsequently 
screened for increased antibody stability. An experimental library, shown in Figure 19b, was derived 

20 by applying a 1% cutoff of occupancy to the Monte Carlo output from each set of calculations, i.e. only 
amino acid substitutions which occur in 10 or greater variants out of the 1000 Monte Carlo output 
sequences are included in the library. Additionally, the glutamine was added at light chain position 89 
so that the WT sequence is represented. This combinatorial library has a complexity of 5184. 
In the above calculation, for all but one variable position only nonpolar amino acids were considered. 

25 As discussed above, nonpolar residues have a higher tendency to aggregate than polar residues, and 
therefore nonpolar amino acids at the interdomain interface can result in a greater nonreversibility of 
the unbinding/binding transition. Design of a stable interface with greater polar character may thus 
provide greater thermodynamic reversibility and improved solubility. Another Herceptin VhA/ l 
• interface calculation was carried out in which the amino acids considered were chosen as the set 

30 belonging to the surface classification. A number of nonpolar interactions, however, appear critical to 
this interface, both by visual inspection and by their level of conservation in the aligned germ lines 
(Figures 2a and 2b). These positions, including light chain positions 36 and 89, and heavy chain 
positions 95 and 1 10, were floated in the new calculation. The remaining set of variable positions is 
shown in Figure 19c. 

35 

The 1 FVC structure was used as the template for design calculations. The energies of all possible 
combinations of the considered amino acids at the chosen variable positions were calculated using a 
force field containing terms describing van der Waals, solvation, electrostatic, and hydrogen bond 
interactions, and the optimal (ground state) sequence was determined using a DEE algorithm. This 
40 ground state, and the WT Herceptin sequence, are shown in Figure 19c. The fact that the predicted 
ground state sequence is very similar to the WT sequence validates the computational screening 
technology. A diversity of sequences for an experimental library was generated by using a Monte 
Carlo algorithm to evaluate the energies of 1000 similar sequences around the predicted ground 
state. Figure 19c shows the output sequence list from this Monte Carlo search. 
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These results can be used to generate one or more experimental libraries which can be screened for 
increased antibody stability. An experimental library, shown in Figure 19d, was derived by applying a 
5% cutoff of occupancy to the Monte Carlo output from each set of calculations, i.e. only amino acid 
substitutions which occur in 50 or greater variants out of the 1000 Monte Carlo output sequences are 
10 included In the library. Additionally, the WT residues were added to the library so that the sequence 
space sampled experimentally also includes interfaces made up of favorable polar and nonpolar 
residues at these positions. This combinatorial library has a complexity of 4032. 

Example 8: rhumAb VEGF C L /Cy1 Interface Stabilization 

15 The interface between the Q_ and Cy1 domains can also be stabilized using computational screening. 
More favorable interactions were designed between residues which make up the rhumAb VEGF 
C L /Cy1 interface. The C L /Cy1 interface of rhumAb VEGF is shown in Figure E8. Variable positions 
were chosen by visual Inspection of the 1 CZ8 structure, and these positions are shown in Figure 20 
and listed in Figure 21a. Because these positions are almost completely sequestered from solvent, 

20 the amino acids considered were chosen as the set belonging to the core classification, even for 176, 
178, and 189 which are polar amino acids in the WT sequence. The WT amino acids were, however, 
also considered at these positions. The conformations of amino acids at variable positions were 
represented as a set of side chain rotamers derived from a backbone-independent rotamer library. 
The 1CZ8 structure was used as the template for design calculations. The energies of all possible 

25 combinations of the considered amino acids at the chosen variable positions were calculated using a 
force field containing terms describing van der Waals, solvation, electrostatic, and hydrogen bond 
interactions, and the optimal (ground state) sequence was determined using a DEE algorithm. This 
ground state, and the WT rhumAb VEGF sequence, are shown in Figure 21a. The fact that the 
predicted ground state sequence is very similar to the WT sequence validates the computational 

30 screening method. A diversity of sequences for an experimental library was generated by using a 
Monte Carlo algorithm to evaluate the energies of 1000 similar sequences around the predicted 
ground state. Figure 21a shows the output sequence list from this Monte Carlo search. 
These results can be used to generate one or more experimental libraries which can be subsequently 
screened for increased antibody stability. An experimental library, shown in Figure 21b, was derived 

35 by applying a 5% cutoff of occupancy to the Monte Carlo output from each set of calculations, i.e. only 
amino acid substitutions which occur in 50 or greater variants out of the 1000 Monte Carlo output 
sequences are included in the library. Three additional amino acids were added to this library: 
threonine and serine were added to light chain position 178 and heavy chain position 189 respectively 
so that all polar residues are represented in the library, and the valine at light chain position 178 was 

40 also included even though it did not make the 5% cutoff. As is known in the art, valine is a good 
nonpolar substitution for threonine because the two have nearly identical size and shape. This 
combinatorial library has a complexity of 5184. 
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5 Example 9: Fc Cy3ICy3 Interface Stabilization 

The interface between the Cy3 domains can also be stabilized using computational screening. Again, 
because this domain is a part of the antibody Fc region, improvements made are widely applicable to 
antibodies, independent of what antigen is bound at the variable region. More favorable interactions 
were designed between residues which make up the Fc Cy3/Cy3 interface. Variable positions were 

10 chosen by visual inspection of the 1DN2 structure, and these positions are shown in Figure 22 and 
listed in Figure 23a. Because these positions are almost completely sequestered from solvent, the 
amino acids considered were chosen as the set belonging to the core classification, although the WT 
amino acid was included at each position. The conformations of amino acids at variable positions 
were represented as a set of side chain rotamers derived from a backbone-independent rotamer 

15 library. 

The 1 DN2 structure was used as the template for design calculations. The energies of all possible 
combinations of the considered amino acids at the chosen variable positions were calculated using a 
force field containing terms describing van der Waals, solvation, electrostatic, and hydrogen bond 

20 interactions, and the optimal (ground state) sequence was determined using a DEE algorithm. This 
ground state, and the WT Fc sequence, are shown in Figure 23a. The fact that the predicted ground 
state sequence is very similar to the WT sequence validates the computational screening method. A 
diversity of sequences for an experimental iibrary was generated by using a Monte Carlo algorithm to 
evaluate the energies of 1000 similar sequences around the predicted ground state. Figure 23a 

25 shows the output sequence list from this Monte Carlo search. 

These results can be used to generate one or more experimental libraries which can be subsequently 
screened for increased antibody stability. An experimental library, shown in Figure 23b, was derived 
by applying a 5% cutoff of occupancy to the Monte Carlo output from each set of calculations, i.e. only 
30 amino acid substitutions which occur in 50 or greater variants out of the 1 000 Monte Carlo output 
sequences are included in the library. This combinatorial library has a complexity of 1800. 

Solubility Optimization 

As discussed above, computational screening methods can be used to optimize the solubility of 
35 antibodies by designing favorable, more soluble substitutions at surface exposed nonpolar residues. 
Residues which can be replaced include residues which are exposed to solvent on Individual Ig 
domains, including V H> V L , Cy1, C L , Cy2, and Cy3 as well as the linkers and/or hinges that connect 
them, or which lie at the interface between Ig domains. 

40 Example 10: Campath Solubility Optimization 

All four Ig domains of the Campath Fab antibody fragment were optimized for greater solubility using 
computational screening. Computational screening was applied to evaluate the replacement of all 
exposed nonpolar residues on these domains, including V H , V L , Cy1, C Li with all 20 amino acids. 
Variable positions were chosen by visual inspection of the 1CE1 structure, and include exposed 
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5 nonpolar residues which are not involved in binding antigen. These positions are shown in Figure 24 
and listed in Figure 25a. Each of the 20 amino acids was considered at each variable position. 
The 1CE1 structure was used as the template for design calculations. For each variable position, 
each of the 20 amino acids was substituted and allowed to sample rotamer conformations derived 
from a backbone-independent rotamer library using a flexible rotamer model. A genetic algorithm was 

1 0 used to optimize the conformation of each amino acid substitution at each variable position, with 
energies being calculated during each round of evolution. In this way, the lowest energy rotamer of 
each substitution was determined, and this energy was defined as the energy of substitution for that 
amino acid at that variable position. Thus this design calculation provided an energy of substitution 
for each of the 20 amino acids at each variable position. Figure 25a shows these results. At each 

1 5 variable position, the lowest energy substitution and all amino acid substitutions which are within 1 
unit of energy of the lowest energy substitution are shown. Thus Figure 25a presents the most 
favorable substitutions for each of the variable positions. 

These results can be used to generate one or more experimental libraries which can be subsequently 
20 screened for improved antibody solubility. An experimental library was derived from this 

computational screening output by including the WT amino acid and all favorable polar amino acid 
substitutions at each variable position. As can be seen, no polar substitutions are predicted to be 
favorable for heavy chain position 116, and so this position is left as the WT leucine in the library. 
This experimental library, which has a combinatorial complexity of 1 1200, is shown in Figure 25b. 

25 

Example 1 1: rhumAb VEGF Solubility Optimization 

All four Ig domains of the rhumAb VEGF Fab antibody fragment were optimized for greater solubility 
using computational screening. Computational screening was applied to evaluate the replacement of 
all exposed nonpolar residues on these domains, including V H , V L , Cy1, C L , with all 20 amino acids. 

30 Variable positions were chosen by visual inspection of the 1CZ8 structure, and include exposed 

nonpolar residues which are not involved in binding antigen. These positions are shown in Figure 26 
and listed in Figure 27a. Each of the 20 amino acids was considered at each variable position. 
The 1CZ8 structure was used as the template for design calculations. For each variable position, 
each of the 20 amino acids was substituted and allowed to sample rotamer conformations derived 

35 from a backbone-independent rotamer library using a flexible rotamer model. A genetic algorithm was 
used to optimize the conformation of each amino acid substitution at each variable position, with 
energies being calculated during each round of evolution using a force field containing terms 
describing van der Waals, solvation, electrostatic, and hydrogen bond interactions. In this way, the 
lowest energy rotamer of each substitution was determined. This energy was defined as the energy of 

40 substitution for that amino acid at that variable position. Thus this design calculation provided an 
energy of substitution for each of the 20 amino acids at each variable position. Figure 27a shows 
these results. At each variable position, the lowest energy substitution and ail amino acid 
substitutions which are within 1 unit of energy of the lowest energy substitution are shown. Thus 
Figure 27a presents the most favorable substitutions for each of the variable positions. 



67 



WO 03/074679 



PCT/US03/06598 



5 

These results can be used to generate one or more experimental libraries which can be subsequently 
screened for improved antibody solubility. An experimental library was derived from this 
computational screening output by including the WT amino acid and all favorable polar amino acid 
substitutions at each variable position. As can be seen, no polar substitutions are predicted to be 
10 favorable for light chain positions 15 and 125 and heavy chain positions 80, 118, and 169, and so 
these positions are left as the nonpolar WT amino acids in the library. This experimental library, 
which has a combinatorial complexity of 61440, is shown in Figure 27b. 

Example 12: Herceptin Solubility Optimization 

15 As discussed above, by removing certain regions or domains of an antibody to generate an antibody 
fragment, nonpolar residues that make up the interface with another Ig domain in the context of a full- 
length antibody or larger antibody fragment can become exposed. For example, for Herceptin, the V H 
and V L residues which make up the Vn/Cyl and VjCi interfaces are exposed to solvent in an scFv 
fragment, as is seen in the 1 FVC structure. Computational screening was used to engineer favorable, 

20 more soluble mutations at these positions for Herceptin. Variable positions were chosen by visual 
inspection of the 1 FVC structure, and include the set of exposed nonpolar residues at the C-terminal 
end of the V H and V L domains. These positions are shown in Figure 28 and listed in Figure 29a. 
Each of the 20 amino acids was considered at each variable position. 

25 The 1 FVC structure was used as the template for design calculations. For each variable position, 
each of the 20 amino acids was substituted and allowed to sample rotamer conformations derived 
from a backbone-independent rotamer library using a flexible rotamer model. A genetic algorithm was 
used to optimize the conformation of each amino acid substitution at each variable position, with 
energies being calculated during each round of evolution using a force field containing terms 

30 describing van der Waals, solvation, electrostatic, and hydrogen bond interactions. In this way, the 
lowest energy rotamer of each substitution was determined, and this energy was defined as the 
energy of substitution for that amino acid at that variable position. Thus this design calculation 
provided an energy of substitution for each of the 20 amino acids at each variable position. Figure 
29a shows these results. At each variable position, the lowest energy substitution and all amino acid 

35 substitutions which are within 1 unit of energy of the lowest energy substitution are shown. Thus 
Figure 29a presents the most favorable substitutions for each of the variable positions. 

These results can be used to generate one or more experimental libraries which can be subsequently 
screened for improved antibody solubility. An experimental library was derived from this 
40 computational screening output by including the WT amino acid and all favorable polar amino acid 
substitutions at each variable position. As can be seen, no polar substitutions are predicted to be 
favorable for light chain position 83, and so this position is left as the nonpolar WT phenylalanine in 
the library. This experimental library, which has a combinatorial complexity of 2530, is shown in 
Figure 29b. 
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Example 13: Fc Solubility Optimization 

The Fc region was optimized for greater solubility using computational screening. Computational 
screening was applied to evaluate the replacement of all exposed nonpolar residues on the Cy2 and 
Cy3 domains with all 20 amino acids. Variable positions were chosen by visual inspection of the 

10 1DN2 structure, and include exposed nonpolar residues which are not involved in binding an Fc 

receptor. For example Met252 and Met428 are involved in binding to FcRn (Martin et a/., 2001 , Mol. 
Cell 7:867-877), and Tyr296 and Tyr300 are close to the binding site for FcyRs (Sonderman et al, 
2001, J. Mol Biol 309:737-749). Therefore these residues, despite being exposed nonpolars, were 
not included as variable positions. Variable positions are shown in Figure 30 and listed in Figure 31a. 

1 5 Each of the 20 amino acids was considered at each variable position. 

The 1DN2 structure was used as the template for design calculations. For each variable position, 
each of the 20 amino acids was substituted and allowed to sample rotamer conformations derived 
from a backbone-independent rotamer library using a flexible rotamer model. A genetic algorithm was 

20 used to optimize the conformation of each amino acid substitution at each variable position, with 
energies being calculated during each round of evolution using a force field containing terms 
describing van der Waals, solvation, electrostatic, and hydrogen bond interactions. In this way, the 
lowest energy rotamer of each substitution was determined. This energy was defined as the energy of 
substitution for that amino acid at that variable position. Thus this design calculation provided an 

25 energy of substitution for each of the 20 amino acids at each variable position. Figure 31a shows 
these results. At each variable position, the lowest energy substitution and all amino acid 
substitutions which are within 1 unit of energy of the lowest energy substitution are shown. Thus 
Figure 31a presents the most favorable substitutions for each of the variable positions. 
These results can be used to generate one or more experimental libraries which can be subsequently 

30 screened for improved antibody solubility. An experimental library was derived from this 

computational screening output by including the WT amino acid and all favorable polar amino acid 
substitutions at each variable position. As can be seen, no polar substitutions are predicted to be 
favorable for position 404, and so this position was left as the nonpolar WT phenylalanine in the 
library. This experimental library, which has a combinatorial complexity of 4.9 x 10 8 , is shown in 

35 Figure 31b. 

Affinity Maturation 

As discussed above, a number of strategies can be applied for utilizing computational screening 
methodology to affinity mature antibodies. 

40 

Example 14: rhumAb VEGF Affinity Maturation Using The Antibody/Antigen Complex Structure 
The availability of the bound antibody/antigen structure for rhumAb VEGF enables the affinity of this 
antibody to be enhanced directly using computational screening. More favorable interactions between 
the rhumAb VEGF antibody and its antigen were designed. Variable positions involved in mediating 
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5 this Interaction were chosen by visual inspection of the 1CZ8 structure, shown in Figure 32 and listed 
in Figure 33a. The set of amino acids allowed at variable positions was also chosen by visual 
inspection. Antigen residues which contact variable residue positions were floated. The 
conformations of amino acids at variable and floated positions were represented as a set of side chain 
rotamers derived from a backbone-independent rotamer library. 

10 

The 1C28 structure was used as the template for design calculations. The energies of all possible 
combinations of the considered amino acids at the chosen variable positions were calculated using a 
force field containing terms describing van der Waals, solvation, electrostatic, and hydrogen bond 
interactions, and the optimal (ground state) sequence was determined using a DEE algorithm. This 
15 ground state, and the WT rhumAb VEGF sequence, are shown in Figure 33a. A diversity of 

sequences for an experimental library was generated by using a Monte Carlo algorithm to evaluate 
the energies of 1000 similar sequences around the predicted ground state. Figure 33a shows the 
output sequence list from this Monte Carlo search. 

20 These results can be used to generate one or more experimental libraries which can be screened for 
enhanced affinity for antigen. An experimental library, shown in Figure 33b, was derived by applying 
a 5% cutoff of occupancy to the Monte Carlo output from each set of calculations, i.e. only amino acid 
substitutions which occur in 50 or greater variants out of the 1 000 Monte Carlo output sequences are 
included in the library. Additionally, the WT amino acids at heavy chain positions 31, 54, 57, and 59 

25 were added to the library so that the WT sequence is represented combinatorially in the library. This 
experimental library has a complexity of 2304. 

In another set of calculations, rhumAb VEGF was affinity matured by reengineering antibody residues 
which do not contact antigen. Here the variable positions in the design calculation were those 

30 residues which interact with contact residues, but are not themselves contact residues. As discussed 
above, by using computational screening to explore substitutions in the shell of residues which 
interact with contact residues, a quality diversity of new contact residue conformations can be 
sampled. Variable positions involved were chosen by visual inspection of the 1CZ8 structure, shown 
in Figure 34 and listed in Figure 35a. The set of amino acids allowed at variable positions was also 

35 chosen by visual inspection. The conformations of amino acids at variable positions were 

represented as a set of side chain rotamers derived from a backbone-independent rotamer library. 
The 1CZ8 structure was used as the template for design calculations. The energies of all possible 
combinations of the considered amino acids at the chosen variable positions were calculated using a 
force field containing terms describing van der Waals, solvation, electrostatic, and hydrogen bond 

40 interactions, and the optimal (ground state) sequence was determined using a DEE algorithm. This 
ground state, and the WT rhumAb VEGF sequence, are shown in Figure 35a. A diversity of 
sequences for an experimental library was generated by using a Monte Carlo algorithm to evaluate 
the energies of 1000 similar sequences around the predicted ground state. Figure 35a shows the 
output sequence list from this Monte Carlo search. 
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These results can be used to generate one or more experimental libraries which can be screened for 
enhanced affinity for antigen. An experimental library, shown in Figure 35b t was derived by applying 
a 5% cutoff of occupancy to the Monte Carlo output from each set of calculations, I.e. only amino acid 
substitutions which occur in 50 or greater variants out of the 1000 Monte Carlo output sequences are 
10 included in the library. The WT is already represented in this library, and so no additional amino acids 
were added. This experimental library has a complexity of 784. 

Example 15: SM3 Affinity Maturation Using The Antibody/Antigen Complex Structure 

The availability of the bound antibody/antigen complex structure for SM3 enables the affinity of this 

15 antibody to be enhanced directly using computational screening. SM3 is a mouse antibody that is 
currently being developed as an anticancer agent. The high resolution structure is available of the 
complex of the SM3 Fab with its target antigen, a peptide from the cell surface mucin MUC1 . This 
structure, PDB accession code 1SM3, served as the template for design calculations. More favorable 
interactions between the SM3 antibody and its antigen were designed. SM3 binds the MUC1 peptide 

20 using an extensive binding pocket which involves a large number or SM3 residues. The pocket can, 
however, be separated into two somewhat independent sets of residues, and thus in order to reduce 
the complexity of the computational screen, two separate sets of design calculations were carried out. 
Variable positions involved in mediating this interaction were chosen by visual inspection of the 1SM3 
structure, shown In Figure 36 and listed in Figure 37a and 37b. The set of amino acids allowed at 

25 variable positions was also chosen by visual inspection. Antigen residues were kept fixed in the two 
calculations. The conformations of amino acids at variable positions were represented as a set of 
side chain rotamers derived from a backbone-independent rotamer library. 

The 1SM3 structure was used as the template for design calculations. For both sets of calculations, 
30 the energies of all possible combinations of the considered amino acids at the chosen variable 
positions were calculated using a force field containing terms describing van der Waals, solvation, 
electrostatic, and hydrogen bond interactions, and the optimal (ground state) sequences were 
determined using a DEE algorithm. These ground states, and the WT SM3 sequence, are shown in 
Figure 37a and 37b. A diversity of sequences for an experimental library was generated by using a 
35 Monte Carlo algorithm to evaluate the energies of 1000 similar sequences around the predicted 

ground states. Figure 37a and 37b show the output sequence lists from these Monte Carlo searches. 
These results can be used to generate one or more experimental libraries which can be subsequently 
screened for enhanced affinity for antigen. An experimental library, shown in Figure 37c, was derived 
by applying a 5% cutoff of occupancy to the Monte Carlo output from each set of calculations, and 
40 then these primary libraries were subsequently combined to generate a secondary library with 

mutations at all positions. Additionally, the WT amino acids at light chain positions 50, 53, 56, and 93, 
and heavy chain position 96 were added to the library so that the WT sequence is represented 
combinatorially in the library. This may be particularly important here because some glycine and 
proline residues in the WT sequence were allowed to be variable in the calculations. These amino 

71 



WO 03/074679 PCT/US03/06598 

5 acids can be important determinants of protein backbone conformation, and therefore the benefit of 
their replacement with side chains which are capable of making favorable interaction with antigen may 
be outweighed by unfavorable potential backbone movements. This combinatorial library has a 
complexity of 3.5 x 1 0 6 . 

10 Example 16: Campath Affinity Maturation Using The Antibody/Antigen Complex Structure 

The availability of the bound antibody/antigen complex structure for Campath enables the affinity of 
this antibody to be enhanced directly using computational screening. More favorable interactions 
between the Campath antibody and its antigen were designed. Variable positions involved in 
mediating this interaction were chosen by visual inspection of the 1CE1 structure, shown in Figure 38 

1 5 and listed in Figure 39a. The set of amino acids allowed at variable positions was also chosen 

subjectively by visual inspection. Antigen residues were floated. The conformations of amino acids at 
variable and floated positions were represented as a set of side chain rotamers derived from a 
backbone-independent rotamer library. 

20 The 1 CE1 structure was used as the template for design calculations. The energies of all possible 
combinations of the considered amino acids at the chosen variable positions were calculated using a 
force field containing terms describing van der Waals, solvation, electrostatic, and hydrogen bond 
interactions, and the optimal (ground state) sequence was determined using a DEE algorithm. This 
ground state and the WT Campath sequence are shown in Figure 39a. A diversity of sequences for 

25 an experimental library was generated by using a Monte Carlo algorithm to evaluate the energies of 
1000 similar sequences around the predicted ground state. Figure 39a shows the output sequence 
list from this Monte Carlo search. 

These results can be used to generate one or more experimental libraries which can be screened for 
30 enhanced affinity for antigen. An experimental library, shown in Figure 39b, was derived by applying 
a 5% cutoff of occupancy to the Monte Carlo output from each set of calculations, i.e. only amino acid 
substitutions which occur in 50 or greater variants out of the 1000 Monte Carlo output sequences are 
included in the library. Additionally, the WT asparagine at light chain position 50 was added to the 
library so that the WT sequence is represented combinatorially in the library. This combinatorial 
35 library has a complexity of 486. 

Because of the number of residues involved in mediating the interaction of Campath with its antigen, it 
may be beneficial to reduce the complexity of the design calculations. The use of sequence 
information here will enable the complexity of the computational problem to be reduced while ensuring 
40 that the remaining diversity sampled is of high quality, in terms of the structural, functional, and 

immunogenic fidelity of the antibody. Sequence information was used to guide the choice of variable 
positions and the set of amino acids considered at those positions for the Campath affinity maturation 
calculations. Figures 40a and 40b show the Campath heavy and light chain variable chain sequences 
aligned with the human V H and V L kappa germ line sequences. A new design calculation using this 
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information was run to affinity mature Campath. The sequence information was first used to 
reevaluate the list of variable positions. A subset of the positions in Figure 39a was chosen based on 
the degree of variability at each position in the germ line. The sequence information was used to 
choose the set of amino acids considered at variable positions in the new design calculation. All 
amino acids, and only those amino acids, which appear at each variable position in the germ line were 
considered in the new design calculation. For variable positions in CDR3, for which no sequence 
information is available, all 20 amino acids were considered. This set of amino acids is shown in 
Figure 41a. Antigen residues were allowed to float during the calculations. 

The 1CE1 structure was used as the template for design calculations. In this new calculation, 
energies of all possible combinations were not precalculated. Instead, a genetic algorithm was used 
to screen for low energy sequences, with energies being calculated during each round of "evolution- 
only for those sequences being sampled. The conformations of amino acids at variable and floated 
positions were represented as a set of side chain rotamers derived from a backbone-independent 
rotamer library using a flexible rotamer model. Energies were calculated using a force field containing 
terms describing van der Waals, solvation, electrostatic, and hydrogen bond interactions. This 
calculation generated a list of 300 sequences which are predicted to be low in energy. Clustering was 
performed to facilitate analysis of the results and library generation. The 300 output sequences were 
clustered computationally into 10 groups of similar sequences using a nearest neighbor single linkage 
hierarchical clustering algorithm to assign sequences to related groups based on similarity scores 
(Diamond, R., Coordinate-Based Cluster Analysis, Acta Cryst. 1995 , D51, 127-135.). The 300 output 
sequences were clustered computationally into 10 groups of similar sequences. That is, all 
sequences within a group are most similar to ail other sequences within the same group and less 
similar to sequences in other groups. The lowest energy sequence from each of these ten clusters, 
used here as a representative of each group, is presented in Figure 41a. 

These results can be used to generate one or more experimental libraries which can be subsequently 
screened for increased affinity for antigen. An experimental library can be derived directly from the 
representative cluster group sequences. Thus Figure 41a provides a 10 sequence experimental 
library. To efficiently use experimental resources, this library size of 10 variants could be screened 
first, followed by subsequent screening of sequences or a subset of sequences within the group to 
which the experimentally determined most favorable variant belongs. For example, if variants 4 and 9 
(i.e. the lowest energy sequences from cluster groups 4 and 9) were found experimentally to be most 
favorable, all of the sequences of cluster groups 4 and 9 could be subsequently screened. The 6 
sequences in group 4 and 5 sequences in group 9 are presented in Figure 41b as an example of such 
an experimental library. 

Example 17: D3H44 Affinity Maturation Using Complex And Uncomplexed Structures 

The availability of structural information for both the bound and unbound forms of the anti-tissue factor 

antibody D3H44 provide the opportunity to explore how both complexed and uncomplexed structural 
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information can be used to computationally affinity mature an antibody. D3H44 is a humanized 
antibody that is cunently being developed for treatment of thrombotic disorders. The high resolution 
structure of the D3H44 antibody/antigen complex, PDB accession code 1JPT, and the unbound 
antibody structure, PDB accession code UPS, served as templates in separate sets of design 
calculations aimed at designing more favorable interactions between the D3H44 antibody and its 
antigen. Variable positions involved in mediating this interaction were chosen by visual inspection of 
the 1JPT structure, shown in Figure 42 and listed in Figure 43a. The set of amino acids considered at 
variable positions was also chosen by visual inspection. Antigen residues which contact antibody 
variable position residues were floated in the bound structure calculation. The conformations of 
amino acids at variable and floated positions were represented as a set of side chain rotamers 
derived from a backbone-Independent rotamer library. 

The 1JPT and UPS structures were used as templates in two separate sets of design calculations. 
For both sets of calculations, the energies of all possible combinations of the considered amino acids 
at the chosen variable positions were calculated using a force field containing terms describing van 
der Waals. solvation, electrostatic, and hydrogen bond interactions, and the optimal (ground state) 
sequences were determined using a DEE algorithm. These ground states, and the WT D3H44 
sequence, are shown in Figures 43a and 43b. A diversity of sequences for an experimental library 
was generated by using a Monte Carlo algorithm to evaluate the energies of 1000 similar sequences 
around the predicted ground states. Figures 43a and 43b show the output sequence lists from these 
Monte Carlo searches. 

Notably, the diversity of sequences in the bound output is approximately a subset of the sequences in 
the unbound output. This result validates the use of using unbound structural information for affinity 
maturation, because it indicates that such calculations, while reducing sequence complexity for 
experimental screening, still produce quality antigen binding diversity. That is, experimental libraries 
derived from such calculations are enriched in sequences that favorably bind antigen. For example, 
experimental libraries were generated from the output of both bound and unbound calculations. 
These experimental libraries, shown in Figure 43c, were derived by applying a 1% cutoff of occupancy 
to the Monte Carlo output from each set of calculations, i.e. only amino acid substitutions which occur 
in 10 or greater variants out of the 1000 Monte Carlo output sequences are included in the library. 
Additionally. WT amino acids were incorporated into the library if they were not already represented. 
The combinatorial complexities are 1296 and 211680 for the bound- and unbound-derived libraries 
respectively. As can be seen, a significant portion of the sequences present in the bound-derived 
library are present in the unbound-derived library, which is substantially reduced in complexity from 
random sequences. 

The results from both sets of calculations can be combined to generate an experimental library. An 
experimental library, shown in Figure 43d, was derived by including only those substitutions which are 
present in the Monte Carlo outputs of both bound and unbound design calculations. Additionally, the 
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5 WT amino acid at light chain position 94 was added to the library so all of the WT amino acids are 
represented. This library provides a list of substitutions that are compatible with the antibody in both 
forms, ensuring that the derived library does not contain variants that are poorly behaved in the 
absence of antigen. Furthermore, substitutions which are favorable in the bound form but unfavorable 
in the unbound form may be due to the need for significant conformational changes for binding. 

10 Elimination of these substitutions may trim the library of unfavorable variants which lose entropy upon 
binding. This combinatorial library has a complexity of 864. 

Example 18: Herceptin Affinity Maturation Using The Uncomplexed Structure 

Although there Is a structure available of the unbound Herceptin scFv antibody fragment, there is no 

1 5 available structure of the bound antibody/antigen complex. However, there is a wealth of 

experimental information available which can be used to guide affinity maturation design calculations. 
An alanine scanning mutagenesis study (Kelley et a/., 1993, Biochemistry 32:6828-6835) showed that 
there are four central Herceptin residues, W, X. Y, and Z which are crucial for binding the Her2/neu 
antigen. A subsequent study used phage display to explore sequence diversity at these residues and 

20 residues proximal to them in the 1 FVC structure (Gerstner et a/., 2002, J. Mol. Biol. 321 :851-862). 
The results from these studies were used to guide the choice of variable positions and amino acids 
considered at those positions in design calculations aimed at affinity maturing the Herceptin antibody. 

« 

Here the goal is to utilize computational screening to generate a high quality library that is enriched for 
substitutions at antigen binding positions which are structurally compatible with the Herceptin 

25 antibody. Variable positions were chosen as those positions which show moderate variability in the 
phage display results. That is, positions that were very intolerant to mutation (one amino acid identity 
was observed in the majority of selected sequences), and positions that were very tolerant to mutation 
(no preference for amino acid identity was observed) were not chosen as variable positions. 
Mutations at these positions are expected to have a deleterious effect or no effect respectively on 

30 antigen binding. Positions that have some but not stringent amino acid requirements have the most 
value in terms of exploring diversity which may be more favorable for antigen binding. These 
positions are shown in Figure 44 and listed in Figure 45a. The set of amino acids considered at these 
variable positions was also guided by the experimental results. For a given position, if the diversity of 
substitutions observed was greater than 90% polar or nonpolar residues, the amino acids considered 

35 for that position were chosen as the set belonging to the surface or core classification respectively. If 
no trend was observed, the amino acids considered for that position were chosen as the set belonging 
to the boundary classification. The conformations of amino acids at variable positions were 
represented as a set of side chain rotamers derived from a backbone-independent rotamer library. 
The 1FVC structure was used as the template for design calculations. The energies of all possible 

40 combinations of the considered amino acids at the chosen variable positions were calculated using a 
force field containing terms describing van der Waals, solvation, electrostatic, and hydrogen bond 
interactions, and the optimal (ground state) sequence was determined using a DEE algorithm. This 
ground state, and the WT Herceptin sequence, are shown in Figure 45a.. A diversity of sequences for 
an experimental library was generated by using a Monte Carlo algorithm to evaluate the energies of 
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5 1000 similar sequences around the predicted ground state. Figure 45a. shows the output sequence 
list from this Monte Carlo search. 

These results can be used to generate one or more experimental libraries which can be screened for 
enhanced affinity for antigen. An experimental library, shown in Figure 45b, was derived by applying 
10 a 1% cutoff of occupancy to the Monte Carlo output from each set of calculations, i.e. only amino acid 
substitutions which occur in 10 or greater variants out of the 1000 Monte Carlo output sequences are 
included in the library. Additionally, the WT amino acids at light chain positions 53 and 91 , and heavy 
chain positions 59 were added to the library so that the WT sequence is represented combinatorially 
in the library. This experimental library has a complexity of 16800. 

15 

All references cited herein are incorporated by reference in their entirety. 

Whereas particular embodiments of the invention have been described above for purposes of 
20 illustration, it will be appreciated by those skilled in the art that numerous variations of the details may 
be made without departing from the invention as described In the appended claims. 
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CLAIMS 

claim: 

1 . A method for optimizing at least one physico-chemical property of an antibody, said method 
executed by a computer under the control of a program, said computer including a memory 
for storing said program, said method comprising the steps of: 

a. receiving a template antibody structure; 

b. selecting at least one variable positions which belong to said template antibody 
structure; 

c. selecting at least one amino acids to be considered at said variable positions; 

d. analyzing the interaction of each of said amino acids at each variable position with at 
least part of the remainder of said antibody, including said amino acids at other 
variable positions; and 

e. identifying a set of at least one antibody sequence with at least one optimized 

physico-chemical property. 

2. A method according to claim 1 , wherein at least one of the optimized physico-chemical 
properties is selected from the group consisting of stability, solubility, and antigen binding 
affinity. 

3. A method according to claim 2. wherein at least one of the optimized physico-chemical 
properties is stability. 

Arr A method according to claim 3, wherein the stabilized portion of said antibody is selected from 
the group consisting of a domain and an interface between domains. 

5. A method according to claim 4, wherein the stabilized portion of said antibody is a domain. 

6. A method according to claim 4, wherein the stabilized portion of said antibody is an interface 
between domains. 

7. A method according to claim 2, wherein the physico-chemical property is solubility. 

8. A method according to claim 7, wherein at least one antibody sequence possesses an 
increase in polar character. 

9. A method according to claim 7, wherein said selecting step further comprises selecting at 
least one nonpolar amino acid and substituting said nonpolar amino acid with a polar amino 

acid. 
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10. A method according to claim 7, wherein said selecting step further comprises altering the pi of 
the antibody. 

11. a method according to claim 2, wherein at least one of the optimized physico-chemical 
properties is antigen binding affinity. 

12. A method according to claim 1 1 , wherein at least one of said variable positions is located in a 
framework region of the antibody. 

13. A method according to claim 1 1 , wherein at least one of said variable positions is located in a 
complementarity determining region (CDR) of the antibody. 

14. A method according to claim 1 , wherein each of said amino acids at each of said variable 
positions are represented as a group of potential rotamers. 

15. A method according to claim 1 , wherein at least two variable positions are selected and at 
least two amino acids are considered at each variable position. 

16. A method according to claim 1 , wherein said analyzing step further comprises a 
computational step utilizing at least two of the energy terms selected from the group consisting of van 
der Waals, electrostatics, hydrogen bonds and solvation. 

17. A method according to claim 1 , wherein said variable positions are chosen based on their level 
of variability in a set of aligned antibody sequences. 

18. A method according to claim 1 , wherein one said amino acids are chosen from a list of amino 
acids which occur at said position or positions in a set of aligned antibody sequences. 

19. A method according to claim 1, wherein said analyzing step includes a Protein Design 
Automation program. 
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21 . A method according to claim 1 , wherein said antibody is selected from the group consisting of 
a full-length antibody and an antibody fragment. 

10 

22. A method according to claim 1 , wherein said antibody sequence is substantially encoded by at 
least one mammalian antibody gene. 

23. A method according to claim 1 , wherein said antibody is selected from the group consisting of 
15 a fully human antibody, a humanized antibody, a chimeric antibody, and an engineered antibody. 

24. A method according to claim 1 , further comprising f) generating a library from said set of at 
least one antibody sequence. 

20 25. A method according to claim 24 wherein said library is a computational library. 

26. A method according to claim 24 wherein said library is generated experimentally. 

27. A method according to claim 24 further comprising g) experimentally screening said library. 

25 

28. A method according to claim 27, wherein said library is screened using at least one selection 
method. 

29. A method according to claim 25, wherein said library is screened using at least one selection 
30 method selected from the group consisting of: phage display methods, cell surface display, in vitro 

display, and cytometric screening. 

30. A method according to claim 25, wherein said selection method is a directed evolution method. 
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32. An antibody sequence according to claim 28, wherein said antibody sequence is substantially 
encoded by a mammalian antibody gene. 

33. An antibody identified from said screening of claim 24. 

34. An antibody to claim according to claim 33, wherein said antibody Is a full-length antibody or an 
antibody fragment. 

35. An antibody according to claim 33, wherein said antibody is selected from the group consisting 
of a fully human antibody, a humanized antibody, a chimeric antibody, and an engineered antibody. 

36. A method of treating a patient in need of said treatment, comprising administering an antibody 
of claim 28 to said patient. 



80 



WO 03/074679 



PCT/US03/06598 



Figure 1 
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Figure 9a 
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Figure 45a 
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I. Introduction 

A molecular explanation of antibody effector function requires the 
description of multiple antibody molecules cross-linking an array ot 
antigen molecules to multiple effector molecules. The antigen mole- 
cules are likely to be on a cell surface and the effector molecules are 
either large, as for complement, or also on a cell surface, as tor *c 
receptors. Therefore this is a complex problem. 

As regard antibodies, we have crystal structures for Fab fragments, 
for Fab fragments complexed with a number of antigens (reviewed in 
Davies et al., 1990), and for Fc from IgG (Deisenhofer, 1981; Sutton 
and Phillips, 1983). We also have the low-resolution structures ot two 
mutant whole IgG molecules (Silverton et al, 1977; Sarma ^and 
Laudin, 1982; Rajan et al, 1983). These mutant molecules lack the 
hinge region and generally show poor effector activity (Burton, 1985). 
They crystallize and show a complete diffraction pattern probably 
because the loss of the hinge has reduced their flexibility Native IgG 
molecules are flexible and do not give diffraction from the Fc part ot 
the molecule (Huber et al, 1976; Ely et aZ.,1978). Therefore, our best 
picture of the whole IgG molecule is probably built up by combining 
the crystal data on the fragments with techniques such as electron 
microscopy (EM) and solution studies giving information on whole 
antibody conformation and flexibility. For the other antibody classes 
we have less data and generally have to make some fairly broad ex- 
trapolations from IgG in order to have a working model. A small num- 
ber of studies provide us with some ideas as regards the arrangement ot 
antibodies in arrays such as might be found triggering effector systems. 

No effector molecule has been crystallized. However the gross 
shape of the complement Clq as a bunch of tulips is well known, and 
the dimensions of some of the other complement components interact- 
ing with antibody are emerging. Many of the Fc receptors have now 
been cloned and sequenced and shown to belong to the immunoglobu- 
lin supergene family. Coupled with recent data on the sites on anti- 
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Fig. 1. The structure ofFc. The structure was solved for the Fc fragment from pooled 
human IgG (Deisenhofer, 1981); a-carbon positions; O, approximate centers of carbo- 
hydrate hexose units. 

body molecules interacting with effector molecules, one can therefore 
place some useful constraints on how the three molecules, i.e., anti- 
gen, antibody, and effector, could be arranged in space relative to one 
another. Such arrangements then need to be placed into the context of 
arrays, and here we are largely at the stage of suggesting models for 
experimental investigation. 

In this review we shall consider effector functions in turn, concen- 
trating first on IgG, for which molecular information is most detailed, 
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FlG. 3. The structure oflgE. A stereodrawing of the a-carbon trace of the revised model 
of Fc a from Helm et ol. (1991) based on sequence homology with Fc of IgC (Deisenhofer, 1981) 
and taking into account the parallel nature of the inter-c chain disulfide bonds. The C,2 domains 
(red) are at the top, the C,3 (green) are in the middle, and the C,4 (magenta) are at the bottom. 
Carbohydrate side chains (yellow) as found in IgC^ (Deisenhofer, 1981) are drawn attached to 
Asn 394 between the C«3 domains. The intradomain and interchain cysteines (Cys 241 and 
Cys 328) are indicated by large circles (yellow). The white segment on the left €-chain highlights 
the location of an Fc,Rl-binding peptide (residues 301-367). 
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Fic. 4. The C H 2 domain N-Hnked carbohydrate of IgG. As shown, four types of 
mannosyl chitobiose cores are found (± bisecting N-acetylglucosamine/± fucose) and 
other chain variants include the presence or absence of galactose and sialic acid. 



tunicamycin or by engineering Asn 297, the carbohydrate attachment 
residue, to another amino acid. The question then arises as to what 
effect loss of carbohydrate has on Fc structure. Proton nuclear mag- 
netic resonance (NMR) (Matsuda et al., 1990) comparing native and 
recombinant Fc indicates that the overall structures are quite similar; 
there is no evidence for example of inward "collapse" of the two C y 2 
domains. However, some differences are monitored, most notably by a 
reporter group (His 268) on a loop in spatial proximity to Asn 297. 
Small differences are also sensed by His 433 and His 435 (Gy3 domain) 
at the junction of the C y 2 and C y 3 domains and remote from the 
glycosylation site. One caveat here is that the NMR studies refer to Fc 
fragments and the presence of Fab arms could affect an aglycosylated 
Fc. 

Finally, the protein in Fig. 1 begins at resiclue 238 whereas Fc is 
generated by papain cleavage N-terminal to Cys 226. The intervening 
residues are not defined in electron density maps of human Fc presum- 
ably because of flexibility. This region is probably better considered as 
part of the hinge (the "lower hinge"; see below) even though it is 
coded for in the C v 2 exon. 

B. Conformation and Flexibility of IgG: The Hinge 

Both immunoglobulin flexibility and the conformation of anti- 
bodies have been recently reviewed (Nezlin, 1990; Burton, 1990a,b; 
Schumaker et al., 1991). Again only major points will be summarized. 
Flexibility of antibodies is classically equated with Fab arm flexibility 
(Y to T shape changes), which should allow "variable reach" for the 
antibody and therefore bivalent recognition of differently spaced anti- 
gens. However, the IgG molecule has available a number of modes of 
flexibility as illustrated in Fig. 5. It would seem very likely, although it 
is not formally demonstrated at this stage, that this flexibility would be 
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Genetic binge 



Fic. 6. Comparison of hinge sequences of IgC. In human IgC! the hinge exon 
corresponds to residues Cly 216 to Pro 230 (Eu numbering) (Kabat et ai, 1987). Cys 220 
forms a disulfide bridge to the light chain, and so residues 216-220 belong structurally to 
the Fab rather than to the hinge. This also applies to mouse IgG!, rabbit IgC, rat IgGi, 
and rat IgG^. The designation of these residues in the other isotypes is less clear. For the 
sake of consistency we refer to the upper hinge as the sequence between position 220 
and the first interheavy disulfide cysteine. The middle or core hinge contains a variable 
number of cysteines involved in interheavy chain disulfide linkages. The repeat unit 
(r.u.) in human IgC 3 corresponds to the sequence EPKSCDTPPPCPRCP. Crystallogra- 
phy indicates that Pro 238 in human lgCi is the first residue forming part of the folded 
C H 2 domain so that residues Ala 231 to Gly 237 (and equivalent residues in other 
isotypes) are assigned to the lower hinge. Alignments are after Burton (1985, 1987) and 
Feinstein et al (1986). 



relation between the upper hinge length and segmental flexibility, 
believed to be Fab arm motion (Oi et al, 1984; Dangl et al, 1988). In 
addition, protein engineering studies (Schneider et ai, 1988) have 
suggested that specific interactions between the hinge and the C y l 
domain can affect this segmental flexibility. Molecular mechanics 
methods have been applied to model the hinge-C y l domain interface 
in human IgG 4 and used to suggest that the interface may have a 
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F,c. 7. Schematic representation of the conformations of the human IgC subcl ««. 
These average solution conformations are deduced on the basis of ^H-angle X :»y 
IcattenngTnd sedimentation data (K. C. Davis and D. R. Burton, ^P ub, ; s h he p d ^ 
views oflgGo are shown, the bent shape being very similar to unat seen .n M 
(Ryazantsev et al, 1989). A tripod-like bent shape has been described from 
TcZScVoSaphs of human IgG, (Ryazantsev et al, 1990). IgC 2 and !gG 4 show the 
closest approach of Fab and Fc in the average conformations above. 



different conformation in IgG* than in the other human IgG subclasses 
(Snow and Amzel, 1988). Models for the solution conformations of the 
human IgG subclasses are shown in Fig. 7. 

C. Other Antibody Classes 

No crystal structures are available either for whole immunoglobu- 
HnVor for Fc foments other than those for IgG. Therefore, models for 
the 5S lsses g have been suggested on the basis of sequence pomol- 
ogy with IgG together with EM and solution studies. These are pre 
sented in Fig. 8. 

111. Complement Activation by Antibodies 

The complement cascade can be usefully divided into two phases 
(Howard and Hughes-Jones, 1988): (1) the deposition of X3 ^onttM 
antigenic cell surface, thereby facilitating phagocytos.s-m the class, 
cal pathway this requires the sequential activation of CI and C42, 
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Secretory 



igA 
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F.c. 8. Schematic representations of the structures of the five antibody classes. 
Heavy chains are shown blank and stippled, light chains are dark. The N-linked carbohy- 
drate structures are represented as Y shapes and O-linked carbohydrates are shown .as 
wavy lines. The representation for IgC can be compared directly with Fig. 2. The other 
structures are suggested on the basis of sequence comparison with extopotahop 
from known features of IgC structure, and physical studies (Burton 1987 1990b). IgM 
monomer has a pair of C„2 domains replacing the hinge, unpaired C 3 domains, and 
C-terminal tailpieces. The pentamer incorporates a molecule of J chain, which may 
adopt an Ig domain structure. Monomeric semm IgA, shown has an extended hinge with 
10 O-linked carbohydrate chains, an unusual positioning of the C H 2 domain carbohy- 
drates that makes the unpaired configuration of C„2 domains speculative, and C-terminal 
tailpieces. IgA 2 has a much shorter hinge, with the light chains generally disulfide linked 
to one another rather than to the heavy chain. Secretory IgA is a dirner in which 
monomers are disulfide linked via J chain and the arrangement stabilized by secretory 
component, a structure of five lg-like domains. IgD has a very extended hinge divided 
into a region rich in O-linked carbohydrate and a highly charged region, possibly in 
helical conformation, and short tailpieces. IgE closely resembles IgM monomer: 



(2) the formation of the membrane attack complex resulting in cell 
ly S j s _this requires the activation of C5 and attachment of C6-9. 

Antibody is involved in the first phase. Attachment of CI to antibody 
in an appropriate arrangement, typically spatially associated by reac- 
tion with antigen, leads to CI activation. CI, a giant complex of the Clq 
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hexamer and the Clr 2 Cls 2 tetramer, binds via Clq to antibody, result, 
na irTcir then Cls activation. Activated Cls (Cls) then cleave C4 to 
^^iSo-i* bond, which can now interact J 
Z cM surface or the proximal antibody molecules. Activated and 
«choi dS then accepts a C2 molecule, which is also cleaved by Cls. 
¥L? m complex is now able to cleave C3, again to reveal a labile 
tester Sf which rapidly becomes covalendy attached to groups 

^ Sm^Etenn. we should like to understand the binding of Clq 
to *e binding of Clq to antibody arrays, 

£c^s leading to CI activation, and the arrangement of the mole- 
cules involved in C3 deposition. 

A. The Interaction of Clq and Antibody 

1. Complement Clq 

Cla is a hexavalent molecule with a molecular weight of approxi- 
mately 460,000 and a structure likened to a bouquet of tulips » which 
^collagenous base and arm regions (stalks are connect Mo « 
globular head regions (Reid and Porter 198 ; Re d ^Coopjj 
1985; Kilchherr etnl., 1985; Schumaker et al, 1987) (Figs 10 14) i ne 
basic structure is conserved, for example, between man and bullfrog 
(Alexander and Steiner, 1980), and there is ^™^«*™* 
and IgG from different species to interact (Burton, ^The ab to 
eenerate isolated Clq heads by controlled collagenase digestion al 
fows itc btding to be localized to the heads. EM studies indicate 
con idlrable flexibility in the Clq molecule variable base-arm an- 
gle), which could have importance in recognizing an W ^ . but 
this flexibility is reduced in the physiological l.gand CI (Poon et al., 
1983; Schumaker et al, 1987). 

2. Thermodynamics of the Interaction 
One of the early problems in this field was why complement was 
triggered by anugen complexed antibody but not by ree anybody as 
occurs for example, in high concentration in serum. It wa^ suggested 
that the binding of antigen might transmit conformational change to 
the Fc region, which would enhance Clq binding and lead to wtiva 
tion. The thermodynamics of the interaction give no support for such a 
notion for complement triggering by IgG. affBTe a at ed 
Isolated Clq heads bind to normal pooled human IgG Jimg 
form with a binding constant of the order (1-5) x 10 M (Hughes 
Jones and Gardner, 1979). This value is also observed for ^ binding 
of Clq to monomeric pooled human IgG (Sledge and Bing, 1973. 
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Hughes-Jones, 1977; Muller-Eberhard, 1975). Only when both of the 
interacting partners are capable of multivalent interaction does the 
affinity increase to the order of 10 8 M" 1 (Burton, 1985). Detailed mod- 
. eling studies show that these differences in affinity, founded on multi- 
valent versus monovalent interaction, are sufficient to explain the 
binding of Clq to antibody associated on a cell surface even in the 
presence of physiological monomeric IgG (Dower and Segal, 1981). 

Other arguments favoring the associative versus the allosteric model 
of IgG complement triggering have been discussed elsewhere (Metz- 
ger, 1978; Burton, 1985). Perhaps one of the greatest problems for an 
allosteric model is provided by the complement-activating ability of 
the human IgG3 molecule, which has a hinge length of the order of 
15 nm including 21 proline residues and 11 interheavy chain disulfide 
bonds. It is not easy to conceive of the transmission of a common 
conformational change through this structure on the occupation of the 
antibody combining site by diverse antigen. 

The binding of the human IgG subclasses in monomeric form to 
human Clq has been investigated by ul trace ntrifuge studies (Schu- 
maker et al\, 1976). Assuming two IgG sites per Clq head, which 
gives the best fit to the experimental data, the rank order IgGa 
(K a = 2.9 x 10 4 M' x ) > Igd (1.2 x 10 4 ) > IgC 2 (0.6 x 10 4 ) > IgC< 
(0.4 x 10 4 ) was found. There was considerable error in the last value. 
The current consensus implies that there is no appreciable affinity of 
IgG 4 in associated form, for Clq (Carred et al. t 1989; Horgan et al, 
1990; Tan et al, 1990). 

In contrast to IgG, IgM occurs in its native form in an associated 
state, primarily a pentamer (Fig. 8). Uncomplexed, the pentamer ex- 
presses a single Clq-binding site with an affinity estimated as 5 x 10 5 
M' 1 (Feinstein et al, 1983) or as 10 4 M~ l (Poon et al, 1985). Com- 
plexed with antigen the affinity of the Clq-IgM interaction increases 
to of the order of 5 x 10 7 M ~ l . This increase arises in antibody excess 
(IgM binding multivalently to the same molecule expressing multiple 
epitopes) or in antigen excess (IgM cross-linking different antigen 
molecules). However, only the former case leads to CI activation 
(Feinstein et al, 1983). Therefore, it is suggested that the functionally 
important increase in affinity arises from the exposure of new Clq- 
• binding sites on a single IgM molecule on complexation rather than 
the spatial association of monovalent pentamers. The ability of single, 
complexed IgM molecules to activate CI supports this view (Borsos 
and Rapp, 1965a,b; Ishizaka et al, 1968; Feinstein et al, 1983). Simi- 
larly, on a cell surface it is found that complement activation only 
occurs when more than one antigen-binding site in the IgM molecule 
is occupied (Borsos et al t 1981). 
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i evs unlated bv proteolytic removal of the F(ab')2 
Interestingly Fc5. isolated Dy proreu y unC omplexed 

units, shows the same binding affinity ']« C ^»*£ ^ J KSu]t 
IgM. Henee the creation of new Clq sites ' «^ ^Jj^ sites 
of antigen binding '^^^^^^^ of conforma- 
S3 FS Just rco^functiona, IgM binding to anti- 

gen (see below). 

3 Clq-lgC Interaelion ol the Molecular Leoe! 
A numbe, of studies have indiee.ed Use ^"^.^ 

gives indication that three charged residues, Glu 318 Lys 320, and >ys. 
322 constitute the essential binding motif. This motif is part of a site 
Ser propoXd (Burton et al., 1980). Figure 2 shows the l^"*™ 
o? the motif and Fig. 9 shows the mutations that were made ,n the 

is fhe entire binding region, and, if so, any ^^^^Z 
mnti r rhp ahi i itv 0 f a peptide mimic of the motif to inhibit tne 

involvement of three charged groups if interacting with ^three mn.Iar 
eroups on Clq would imply the release of six ions per Clq head bound. 
SeTbservation of 12 ions released 

two heads bound per Clq molecule to an immune *V^>™£& 
in agreement with the thermodynamics Other ™ ™™ 

implied that a positive charge is required at ^* 0 ™™J>£*^ T 
substitute for Lys) and probably a hydrogen bond at pos Uon 3 8 CThr 
can substitute for Glu). Position 320 will ^\ e ^Vtll coZ^- 
retention of Clq binding, but the latter mutation abrogate s compto 
ment activation. This underscores the existence of requ rement add. 
tional to Clq binding for effective complement activation. It should 
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also be noted here that experiments with IgG molecules with heterolo- 
gous heavy chains have been used to suggest that only one heavy chain 
is essential for Clq binding (Clark et al, 1989a). 

Other studies are suggestive of some importance for the context of 
the motif. A substitution of Gin by Glu at position 324 m mouse IgG to 
adjacent to the motif abrogates Clq binding and complement activa- 
HonTNose et Q l, 1988; Nose and Leanderson, 1989). The behavior of 
different IgG isotypes also suggests the importance of context. The 
Jumln gG subclasses all possess the motif and yet IgG 4 does no 
Sficandy bind Clq. The favored interpretation of this has been that 
S Fab arms of lgG 4 obstruct the Clq-binding site because of the 

-restricted" hinge of IgG 4 (Burton, ^^Z^ZTlZ^ an 
are the description of Fc4 (Fc fragment of lgC 4 ) binding CI with an 
affinity comparable to that for Fcl and, in aggregated form, activating 
complement (Isenman et al, 1975). The inability «f hmge^dekted 
human IgGi mutants to bind CI or activate complement seem to fur- 
ther provide examples of just such Fab arm obstruction (Klein et al, 
1981) New protein engineering studies (Tan et al, 1990; Norderhaug 
et al 1991) provide a challenge to this interpretation of IgC 4 function. 
In particular, an IgG 4 molecule wherein the hinge exon of lgG 4 is 
replaced with those found in IgG 3 fails to bind Clq or activate comple- 
ment. An IgC 3 molecule with the genetic hinge of IgC 4 still binds Clq 
effectively (Tan et al, 1990). IgG 3 molecules with an IgC 4 hinge or an 
IgG 4 hinge and CHI domain activate complement more eflectively 
than wild type IgG 3 (Norderhaug et al, 1991). A 

Therefore, it would appear that there is a structural lesion in Fc4that 
leads to its inability to bind Clq. The data of Isenman et al would 
require that this lesion is sensitive to the presence of the Fab arms. 1 he 



F, C . 9. Mutations made in a protein engineering strategy to localize the Clq-b.nd ng 
site on IgG (after Duncan and Winter, 1988). The diagram shows an a-carbon 
C H 2 domain from human Fc (Deisenhofer. 1981). (A) Groups of res.dues , ^>"»"" 
proposed as Clq-binding sites. A. Residues 274-281 (Boackle et al 1979); B, residues 
282-292 (Lukas et al, 1981); C, residues 290-295 (Brunhouse and Cebra. 1979). O 
residue 318. 320, 322. 331. 333, 335. and 337 (Burton et «.!.. 1980). (B) Residues ,n wh*h 
exposed side chains were altered by site-directed mutagenesis in ^ "ornologous mouse 
lgC Ib antibody. Residues altered to Ala were S239. K248. 1253. D265 S2 67 ,D2 70 Q274 
E283. H285. Q290, E294. N297. K317. E318, K320. K322, K326. E333 T335. S337 and 
K340. P331 was mutated to Gly and E235 (not shown) was mutated to Leu. Open circles 
show mutants that were still lytic; closed circles show muUnts that were L^'"*^ 
suggested to comprise the Clq-binding site. i.e.. E318. K320, and K322. The carbohy- 
drate attachment residue, N297. is shown shaded. 
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positions in the CH2 domain distinguishing IgG 4 from the other sub- 
classes have been discussed previously (Burton, 1985; Jefferis, 1986). 
The sequence Ser330-Ser331 (Ala-Pro in the other subclasses, Fig. 1) 
in close proximity to the motif stands out but there are other differ- 
ences, such as Phe 234 (Leu) and Gin 268 (His). The ability of an 
IgG 4 /IgGi switch mutant (CHI to residue 291 of CH2 from lgG 4 ; 
residue 292 to end of CH2 and CH3 domain from IgGi) to activate 
complement implies that residues 292-340 in the CH2 domain contain 
the amino acids responsible for the inability of IgG 4 to activate (Tao et 
al, 1991). 

The case of human IgG 4 may have implications for the Clq-binding 
and complement-activating patterns of all the isotypes. Basically there 
have been two schools of thought. Both saw the Clq-binding site as 
being present on most isotypes but modulated by the Fab arms. The 
hinge, and in particular, the upper hinge, was seen as crucial. The first 
school tended to emphasise the necessity for flexibility in this upper 
hinge in binding Clq and pointed to a correlation between Clq bind- 
ing and segmental flexibility (Oi et al, 1984; Dangl et al, 1988). The 
second school tended to associate "restricted" hinges, which could 
arise from shorter upper hinges, with structural accommodations plac- 
ing Fab arms closer to Fc (Burton, 1985). It pointed to solution data 
indicating more compact conformations for isotypes such as human 
IgG 2 and IgG 4 with their inferior Clq-binding ability compared to the 
more extended IgGi and IgG 3 (Gregory et al, 1987). The data of Tan et 
al. show that upper hinge flexibility per se (at least in the nanosecond 
time range) is not necessary for Clq binding. Thus, for example, two 
"rigid" mutant IgG 3 molecules are able to bind Clq as effectively as 
the "flexible" wild-type IgG 3 molecule. They also show that close 
approach of the Fab arms is probably not the reason for the failure of 
IgG 4 to bind Clq unless CH2 folded domain sequences were some- 
how controlling hinge conformation and therefore Fab arm dispo- 
sition. 

What is the origin of the poor Clq binding of isotypes such as mouse 
IgGi and guinea pig IgGi and the loss of Clq binding associated with 
hinge deletion (Klein et al, 1981; Michaelsen et al, 1990; Tan et al, 
1990)? Fab arm obstruction is an appealing explanation brought into 
question by the human IgG 4 results. Other, unidentified structural 
lesions are a possibility. In particular, hinge deletion may subtly per- 
turb conformation. Interestingly, for mouse IgGi, replacement of the 
C 7 2 domain with that of mouse IgG 2 b generates Clq binding and 
complement lysis at a level comparable to that for IgG 2 b (Clackson and 
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Winter 1989). This again militates against the importance of the upper 
hinge (hinge exon) but leaves open the role of the lower hinge (C,2 
domain exon). Mouse IgC, has a restricted hinge in tenns not only of a 
short upper hinge but also of a short lower hinge where it lacks the 
characteristic Gly-Gly sequence. It should be noted however, that 
introduction of the mouse IgG 2b lower hinge into IgG, does not gener- 
ate a lytic antibody (T. Clackson, personal communication). 

The exceptionally long hinge of IgG 3 is something of an enigma^ 
Two groups have now reported that most of the hinge can be deleted 
without any detrimental effects on Clq binding or complement activa- 
tion (Sandlie et al, 1989; Michaelsen et al, 1990; Tan et al 1990). In 
fact, an IgG 3 with a single hinge exon is more effective at Clq binding 
and'a molecule with an extra exon is less effective. 

Aglycosylation is a context to which Clq binding is sensitive to 
varying degrees. Aglycosylation of mouse IgC^ or mouse IgG 2b pro- 
duces only a small (roughly threefold) reduction in Clq binding affin- 
ity (Leatherbarrow et al, 1985; Duncan and Winter, 1988). Agam a 
small reduction (twofold) has been reported for human Fcl (Matsuda 
et al 1990). In contrast, complete abolition of Clq binding has been 
reported for aglycosylated human IgG, and a dramatic decrease tor 
IgG 3 (Morrison et al, 1989). Even for those cases where Clq binding is 
minimally affected, whole complement activation, where looked at, is 

abolished. ' , , , 

Recombinant hybrid molecules in which Fc is linked to other 
proteins afford an opportunity to look at the effect of context on et- 
fector function. Two groups have reported on recombinant CD4- 
immunoglobulin molecules (Capon et al, 1989; Traunecker et al, 
1989- Byrn et al, 1990). Replacing the V H domain of human yl chain 
by the first two or all four of the CD4 cytoplasmic domains gives 
molecules expressed as dimers in a eukaryotic cell line in the absence 
of light chains. Both molecules bind gpl20 and FcRI, but neither binds 

Clq. However, a molecule in which both V H and C y l domains ot 
mouse IgC 2a are replaced by the first two domains of CD4 does bind 
Clq. It would seem that the presence of the C y l domain, perhaps 
unable to pair with the C L domain, is detrimental to Clq binding. In 
this vein, attempts to graft Clq-binding sites into different molecules 
will undoubtedly be revealing as regards to site requirements. 

Considering the interaction site on Clq, there are charged regions 
on each of three chains in the heads, e.g., the sequence Glu 198-X- 
Asp 200-Lys 202 on the A chain, which might interact with the 
charged motif on IgG (Reid et al, 1982). 
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4. Clq-IgM Interaction at the Molecular Level 

As for IgG, it appears that charged groups are important in the 
interaction of IgM and Clq (Hughes-Jones and Gardner, 1978; Poon et 
al. t 1985). It is estimated that 12 ions are released into solution on the 
binding of Clq to uncomplexed IgM in solution (V. N. Schumaker, 
personal communication) and 8-9 ions are released on the binding to 
IgM interacting with a cell surface. Given the differences in binding 
affinities described earlier, the Clq-binding site on IgM is not ex- 
pected to be identical to that of IgG. Indeed, the IgG motif is not found 
on IgM. 

The Clq-binding site(s) are located on the Fc pentamer (Fig. 8) but 
the precise domain (C M 3 or C^4) has been debated. An interesting 
mutant IgM molecule with a single amino acid change (Pro — > Ser 436) 
in the C^3 domain has been isolated and shown to have decreased 
affinity for Clq (Wright et al., 1988). This has been taken as evidence to 
support the C M 3 domain as binding Clq. It is noteworthy that the 
residue analogous to Pro 436 in IgG is Pro 331 in the C y 2 domain, 
which is on the edge of the proposed IgG-binding site. Further, a 
Pro — > Ser mutation at this position is prominent in the non-Clq- 
binding IgG 4 isotype. The effect of the mutation in IgM is complex in 
that it renders .half of the mutant molecules incapable of binding Clq 
and the other half capable of binding but with lowered affinity. A 
model based on an equilibrium between different IgM conformations 
has been proposed to explain these observations (Wright et al, 1988). 
Another mutant (Asn — > Gin 402, comparable position Asn 297 in IgG, 
the C y 2 glycosylation site), which does not glycosylate, shows de- 
creased complement-activating ability compared to wild-type IgM 
(Muroaka and Shulman, 1989). 

The binding of IgM to antigen has been studied by electron micro- 
scopy and a model has been suggested for how extra Clq-binding 
sites would become available on antigen complexation (Beale and 
Feinstein, 1976; Feinstein and Richardson, 1981; Feinstein et al., 
1983, 1986). Briefly, pentameric uncomplexed IgM generally appears 
crudely as a star-shaped molecule with F(ab')2 unites emerging in 
various orientations from a planar Fc5 unit. On the binding of specific 
IgM to Salmonella paratyphi flagella, new staple-like IgM molecules 
are observed in which the F(ab')2 units are all dislocated out of the 
plane of the Fc5 disk and are bound to the flagella. Similarly, in an 
antidextran IgM/dextran system, the ability to activate CI correlates 
with the appearance of IgM bound to single molecules of dextran in the 
staple form. Therefore, it is suggested that some distortion introduced 
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into the IgM molecule in the star-to-staple transition reveals extra Clq 
citPi and triggers complement activation. 

However^s commented above, removal of the F(ab')* umts does 
noVreveal extra sites, implying the star-to-staple transition generates 
some change in Fc5 conformation. Again, the problems of transmitting 
changes through the domain structure of immunoglobulins have been 
used to argue against an allosteric mechanism. Instead it has been 
suggested (Feinstein et al, 1986) that pivoting about the inter-C M 3 
Se s and readjustment of the spatial relationship of neighboring 
C 4 dtmers might reveal extra Clq sites. Certainly protein engineering 
^perimrnts'ndicate the importance of the inter- C(i 3 bridges inyolv- 
fng CyT414, because the pentameric IgM Ser 414 mutant .s unable to 
activate complement. m 

Perhaps one of the most intriguing developments in this area re- 
cendy has been the demonstration in a number of laboratones of the 
nropensity of IgM to form hexamers, particularly in the ; absence of J 
cnaKs Cattanel and Neuberger, 1987; Davis et al, 1988; Randall et 
al, 1990). The hexamer is found to activate whole complement 10- to 
204old more efficientiy than the pentamer (Davis and Shulman, 1989, 
Randall et al, 1990); Because J chains are not necessary for either 
assembly or secretion of polymeric IgM from B cells it has even been 
suggested that their function may be io regulate the lytic efficiency of 
IgM by controlling the pentamer : hexamer ratio (Randall et al, 1990) 
The existence of both Clq and IgM as hexamers is striking and at least 
suggestive of some involvement of symmetry in the triggering process. 

5. Clq-Associated IgG at the Molecular Level 
The most detailed description available of associated IgG comes 
from two-dimensional crystallization studies of a monolayer of adim- 
trophenol (dnp)-lipid binding a mouse monoclonal anb-dnp IgGi an- 
tibody (Reidler et al, 1986). The antibodies form hexagonal arrays in 
which the Fes are dislocated out of the plane of die Fabs to generate ^an 
angle of about 80°. The monomers interact with one another via both 
Fc and Fab regions. We have constructed graphics representations 
based on this idta (Burton et al, 1989; Burton 1990a) when association 
of Fc regions was made most readily though interaction of Ae large 
hydrophobic patch at the CH2/CH3 domain interface. In fact, although 
the precise amino acids in this region are not conserved between IgGs 
of different species and subclasses, there is a strong conservation , of 
character as an extensive exposed hydrophobic patch. The patch is die 
recognition site for staphylococcal protein A, and the common reacbv- 
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ity of protein A with IgG is comprehensible in terms of its conservation 
(Burton, 1985). 

Two major features of the two-dimensional crystal studies are that 
the IgG molecule is viewed as dislocated and in Fc-Fc interaction. 
Independently, from studies on the interaction of IgG and Fc receptor, 
we suggested that IgG might be dislocated and in Fc-Fc interaction on 
an antigenic surface (Burton, 1986). We highlighted a possible link 
between IgG and IgM in their mode of binding Clq and complement 
activation (Fig. 10). Thus, IgM is normally in an associated state but 
must dislocate to bind Clq effectively. IgG is normally monomeric but 
on binding antigen it would form a defined polymer in a dislocated 
conformation, which would trigger complement. The complement- 
activating molecular species in the two cases is very similar according 
to this model. It is made more plausible by the description of hexa- 
valent IgM, which, in a dislocated conformation, would be expected to 
closely resemble the two-dimensional crystal view of hexameric IgG. 
Interestingly, hexagonal symmetry was also described some time ago 
from EM studies of Fc crystals (Pinteric et al., 1971). 

Experimental evidence on associated IgG conformation in less arti- 




Fie 10. Schematic representation of Clq binding to an array of IgG molecules. The 
IgG molecules are arranged in a hexameric anay as suggested by two-dimensional 
crystallization studies (Reidler et al, 1986). The Fes are dislocated (Burton, 1986) and 
roughly at right angles to the plane of the corresponding Fabs. For clarity, the IgG 
molecules are shown schematically in a ribbon format: the two-dimensional crystalliza- 
tion studies indicate lateral Fc-Fc and V domain interactions. The Clq molecule, 
roughly to scale, is shown interacting with two IgG molecules. Distortions of Clq would 
be required to recognize adjacent IgG molecules. 
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Fig. 11. The autonomous model of complement activation (Howard and Hughes- 
Jones, 1988). The scale model shows the binding of two Clq molecules to two adjacent 
Fc pieces. The two IgG molecules are bound to two different epitopes on the same 
antigen molecule. 



binding constant was decreased in the latter case. Furthermore, an 
F(ab')2 fragment of the IgG2a could not establish a full synergistic 
effect. These data imply some role for the Fc part of the IgG^ molecule 
in Clq binding and are perhaps indicative of stabilization of an anti- 
body array rather than autonomous binding. 

The human IgG subclass pattern seen in the binding of Clq to 
associated IgG roughly mirrors that seen for monomeric IgG. More 
Clq becomes bound to IgG 3 associated on hapten-coated red cells or 
on immobilized hapten-bovine serum albumin than to similarly asso- 
ciated IgGi (Bruggemann et al, 1987; Bindon et al, 1988a; Garred et 
al, 1989). However, the extent of IgG3 superiority depends on epitope 
density and complement. concentration. The difference appears to re- 
side in the number of Clq sites made available in the two cases, the 
binding constants being very similar. More sites are made available by 
the IgG 3m ( g) than the IgG 3m( b) allotype. There appear to be some very 
subtle antibody conformational or geometric requirements for the gen- 
eration of a site capable of binding Clq multivalently. Another study of 
two IgG t antibodies against different epitopes on the same synthetic 
branched polypeptide showed that, with equal amounts of antibody 
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bound to the polymer, one antibody bound severalfold more Clq than 
the other No difference in sequence in the hinge or C y 2 domains 
between the two antibodies was found (Horgan et al, 1990). This study 
again implies preferred antibody conformations or arrangements for 
productive Clq binding. 

Relatively significant Clq binding to associated lgG 2 is observed at 
higher epitope densities and complement concentrations whereas at 
lower ones such binding virtually disappears (Garred et al, 1989). 
These observations may account for earlier apparent inconsistencies in 
the literature. Associated IgG 4 is never observed to bind Clq signifi- 
cantly. 

Finally there are now two reports to show that associated rat IgA is 
capable of binding Clq and of activating CI, but this does not lead to 
C4 deposition or cell lysis (Bindon et a/., 1990; Hiemstra et al, 1990). 
There are conflicting reports about the ability of human IgA to take part 
in classical pathway activation (Iida et al, 1976; Burritt et al, 1977; 
Romer et al, 1980; Jarvis and McLeod Griffiss, 1989). 

B. CI Activation 

Attempts to understand how antibody activates CI are facilitated by 
increased information on the structure of CI (reviewed in Cooper, 
1985; Weiss et al, 1986; Arlaud et al, 1987; Schumaker et al, 1987). 
Briefly, there is ample evidence to indicate that the Clr 2 Cls 2 tetramer 
is very extended as an isolated molecule but that it compacts consider- 
ably when complexed to Clq in CI. Symmetrical models have been 
proposed in which the Cls-Clr-Clr-Cls tetramer is. wound within 
the stalks of Clq in an S or figure 8 shape (Fig. 12). Such models have 
the advantage that they bring the catalytic domains of Clr and Cls into 
contact with one another, thereby making it conceptually easier to 
understand how activated Clr can activate Cls. An asymmetric model 
(Fig. 13) that also achieves this feature has the tetramer folded at its 
midpoint and wrapped around the outside cone of Clq (Cooper, 1985). 
A further set of asymmetric models has been proposed (Perkins, 1985). 

The interesting question for this review is how Clq binding to 
associated IgG (or antigen-complexed IgM) leads to CI activation. The 
evidence seems to favor a distortive model in which binding to an array 
of Fes distorts the cone formed by the spreading Clq arms. This leads 
to autoactivation of Clr, which in turn activates Cls. The best evidence 
for this viewpoint comes from the description of a mouse monoclonal 
Igd antibody [or F(ab') 2 fragment], which is against an epitope on the 
collagenous arms of Clq and which is able to activate CI (Hoekzema et 
al, 1988). 
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Fic. 12. Model of complement Cl (adapted from Arlaud et ai, 1987). The upper 

diagram shows a model of the extended conformations of Clr 2 CIs 2 and how a "figure . 
8-shaped" conformation could be acquired on complexation with Clq. Ir, Is, Interaction 

dcmains of Clr and Cls; Cr, Cs, catalytic domains of Clr and Cls. The lower diagram ti« 

models the Cls-Clr-Clr-Cls tetramer interacting with Clq. * a( 

ai 
b. 

This study shows that (1) bivalency of the antibody is a requirement tc 

for C 1 activation but not for binding to Clq; (2) increasing the segmen- ir 

tal flexibility of the antibody by reduction and alkylation of hinge a( 

disulfides destroys the ability to activate Cl ; (3) an antibody against the P 

Clq heads inhibits Cl activation by associated antibody but not by the C 
anti-Clq arm antibody; (4) isolated Clq stalks (Clq with the heads 

digested away) are still activated by the monoclonal antibody, indicat- ir 

ing the heads are not the origin of the activating signal; and (5) Cl e ' 

activation is optimal at a monoclonal antibody : Clq ratio of 3:1. The c; 

data can be readily interpreted in terms of a symmetrical model a: 

wherein the monoclonaJ antibody distorts a pair of Clq arms to bring a 

Clr and Cls catalytic subunits together in space. In this context it is ^ 

known that dimers of IgG will activate complement (Wright et ai, [ ] 

1980) and therefore by implication the binding of two heads on the i! 

same Clq molecule is sufficient for Cl activation. a 
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Fic. 13. Alternative model of CI (adapted from Cooper, 1985). The Clr 2 Cls 2 tetra- 
mer wraps around the arms of Clq rather than being intertwined. 

A second suggestion has been that associated antibody may activate 
CI by release from the action of CI inhibitor. According to this theory 
CI inhibitor, normally regarded as functioning by actively disassem- 
bling activated CI to give aCHnh-Clr-Cls-CHnh complex binds to 
unactivated CI and prevents activation. Antibody displaces CI inhibi- 
tor and therefore activation proceeds. However, the ability of CI inhib- 
itor to bind to unactivated CI has been questioned and earlier observa- 
tions have been interpreted in terms of CI inhibitor binding solely to 
activated CI (Bianchino et ai, 1988). Generally, it seems that CI can 
autoactivate by both inter- and intramolecular catalysis, but there has 
been controversy over the years about how much importance to attach 
to the mechanisms. A recent study (Hosoi et al, 1987) suggests that 
intramolecular spontaneous activation is very slow but intermolecular 
activation can be rapid. In this second case, typically, contaminating 
proteases convert a little C 1 to activated C 1 , which then cleaves further 
CI molecules. CI inhibitor acts solely on the latter process. 

Tight Clq binding is not a guarantee of CI activation. For exaniple, 
in the case of erythrocytes sensitized with IgG or IgM, both bind CI 
equally well but the rate of activation of CI is far greater in the latter 
case (Colten et ai, 1969). Glutaraldehyde-cross-linked IgG binds Clq 
as effectively as immune complexes but fails to active CI (Folkerd et 
ai, 1980). One of the most detailed investigations of the relationship 
between Clq binding and CI binding/activation is that of Bindon et al. 
(1988b). These authors compared the ability of rat IgG isotypes bind- 
ing to the human lymphocyte antigens CAMPATH-1, MHC class I, 
and LCA (leukocyte common antigen) to bind Clq, bind and activate 
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CI, and mediate cell lysis. These antigens have comparable surface 
densities but show marked differences in lytic ability (CAMPATH-1, 
MHC class I >> LCA). It was found that Clq binding was roughly 
proportional to antibody binding and dependent on antibody isotype. 
However, the lytic antibodies were able to bind and activate more CI 
than were the poorly lytic ones. C3 activation and whole lysis patterns 
then propagated these Gl activation differences. The authors sug- 
gested two factors that might play a role in this "antigen effect," i.e., 
preferential CI binding associated with antigens promoting lysis. One 
was the possibility of antibody-Clr2ClS2 interactions (see below). 
The other was the mode of presentation of antigen-complexed anti- 
body to CI. The authors noted the lower arm flexibility of CI relative to 
Clq (Schumaker et aL, 1987), which might place more rigorous bind- 
ing requirements on the former. 

Therefore, Clq binding and CI activation need not correlate when 
comparing different antigens. For a given antigen, from the limited 
studies to date, they generally do correlate, e.g., human IgG isotypes 
binding to hapten-coated red cells (Bindon et aL, 1988a), rat IgG 
isotypes binding to LCA (Bindon et aL, 1987), and rat IgG isotypes 
binding to hapten-coated red cells (Bindon et aL, 1990). In particular, 
human IgGa binds Clq and activates CI more effectively than does 
IgGi. 

As above, a feature of CI activation receiving some interest is the 
possibility of an interaction between antibody and the Clr2Cls2 tetra- 
mer. A review of the literature in 1985 (Burton, 1985) showed no direct 
evidence for such an interaction involving IgG, and this is still so. 
Similar conclusions have been reached for IgM (Poon and Schumaker, 
1991). However, a number of interesting observations have been 
made. First, as described initially by Reid et aL (1977), the binding 
affinity of Clq for Clr 2 Cls2 is increased by about an order of magni- 
tude on binding to immune complexes (Cooper, 1985). Second, the 
dissociation rate of Clq in the activated CI complex from sensitized 
red cells is about 10-fold slower than that for Clq alone. This applies to 
a high antibody density, the difference narrowing at lower densities 
(Okada et aL, 1985). Third, the rate of CI activation oh a red cell surface 
is dependent on antibody density and is independent of antigen or CI 
density (Hughes-Jones et aL, 1985). Fourth, the lowered dissociation 
rate for Clq in activated CI is not found when rabbit Facb (lacking the 
Cy3 domains) is used instead of IgG. The dissociation rate of Clq alone 
is the same for IgG and Facb (Okada et aL, 1985). Fifth, the Facb- 
bound activated CI complex is more susceptible to CI inhibitor inacti- 
vation than is the IgG-bound complex. The results have been inter- 
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nreted to indicate a direct, albeit weak, interaction between Clr 2 Cls 2 
and the C^ domains of IgG, with these domains protecting the acti- 
vated tetramer from CI inhibitor (Okada and Utsumi, 1989). 

An alternative is that in a Cl-activating situation the conformations 
of both antibody (IgG) and Clq (in CI) become constrained in the 
mutual interaction. The antibody becomes arranged in a defined array 
in which Fc-Fc interactions play a part so that Facb is less effective. 
Array formation is facilitated at higher antibody densities. The Clq 
undergoes a distortion that increases its affinity for Clr 2 Cls 2 . The 
distortion is different depending upon the presence or absence ot 
Clr 2 Cls 2 bound to Clq, leading to a difference in dissociation rates as 
described above. These two alternatives are not mutually exclusive, 
i e there could be tetramer binding to antibody and array formation. 

Finally, recent sedimentation studies show that activated CI binds 
much more tightly to IgM than does Clq alone (Poon and Schumaker, 
1991). It is argued that the binding of activated Cls 2 to Clq, either 
alone or together with activated Clr 2 , induces a conformational change 
in Clq that results in additional Clq heads binding to complementary 
sites on IgM. Cryptic sites on IgM, transitorily exposed by random 
thermal motion, might be "captured" by activated CI, forming a new 
complex that could mimic activated CI attached to cell-bound IgM. 

C. C4 Activation, C3 Activation, and Cell Lysis 
After CI activation, the next step in the classical complement path- 
way is the activation ofC4 through proteolytic cleavage by activated 
Cls (schematic structures for these molecules are shown in Fig. 14). 
The major fragment, C4b, can attach covalently to a suitable surface via 
an activated acyl group. The Fab of IgG (Campbell et al, 1980; Alcolea 
et al, 1987) and the antigen (Garred et al, 1990) have been implicated 
as the surface for C4b deposition in immune complexes.. Studies on 
antibody-coated red cells have found C4 deposition primarily at the 
cell membrane (Circolo and Borsos, 1982, Bindon et al, 1988a). A 
number of studies have investigated isotype patterns of C4 activation. 
Bindon et al, (1988a) looked at the amount of C4b deposited on NIP- 
coated red cells in complement activation by chimeric human anti "NIP 
antibodies. They found that IgGi deposited far more C4b than did 
IgG 3 under conditions in which IgG 3 was more efficient at Clq bind- 
ing and CI activation, as described earlier. The authors showed that 
the poor C4b deposition was due to poor C4 activation rather than 
inability to reach the cell surface, i.e., Cls activated by IgC 3 appeared 
inefficient at C4 activation. Possible explanations suggested included 
easier access of CI inhibitor in the IgG 3 case, favored association of C4 
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Fic. 14. Schematic representation of the molecules involved in the classical pathway 
activation of complement. The molecules are drawn roughly to scale. Dimensions and 
shapes axe taken from Reid and Porter (1981), Perkins (1985), Perkins et al (1990a,b), 
and Odermatt et al. (1981). 



in the IgGi case, and restricted access of C4 to Cls in the IgGa case. No 
C4b binding was detected with either IgC 2 or IgG 4 . 

Garred et al. (1989) looked at C4b deposition on NIP-BSA immune 
complexes formed with the chimeric human anti-NIP antibodies. In 
particular, they studied the IgG subclass patterns as a function of 
epitope density (NIP:BSA) and the concentrations of NIP-BSA, anti- 
body, and complement. They found that IgGi and IgG3 were compa- 
rable in C4b deposition at higher epitope density and BSA-NIP con- 
centration but that IgGi was far less effective at lower values of these 
parameters. IgG2 produced significant C4b deposition at higher epi- 
tope density and BSA-NIP concentration but IgG 4 was ineffective 
under any conditions. In this study, C4b deposition patterns followed 
those of Clq binding. It is unclear why the two studies produce a 
different rank order for IgGi and IgG 3 . It could be that IgGi is gener- 
ally more effective for cell surface activation and IgGa for immune 
complex activation. This interpretation is rendered unlikely by the 
studies of Michaelsen et al. (1991) on isotype patterns in cell lysis (see 
below). A more probable explanation lies, at least in part, in the man- 
ner of presentation of the hapten, and we shall return to this point. 
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Bindon et al (1990) looked at C4b deposition on NIP-coated red 
cells using different isotypes of rat anti-NIP antibody. They found that 
IgM and IgG 2 b were both very efficient, reflecting their efficacy in Clq 
binding and CI activation. However, of IgGi, IgG^, IgG 2c > and IgA, 
which bound Clq and activated CI roughly equivalently, only IgG 2a 
was effective at C4 activation under the conditions employed. 

Fixed C4b interacts with C2 to form the classical pathway C3 conver- 
tase, which in turn cleaves C3. The C3b product can bind covalently to 
an appropriate surface in a manner analogous to C4b. Again this sur- 
face can be antibody (Gadd and Reid, 1981b; Brown et al, 1983; Takata 
et al, 1984), the antigen part of an immune complex (Garred et al, 
1990), or a cell membrane (Circolo and Borsos, 1984; Bindon et al, 
1988a). The available studies generally indicate a good agreement 
between efficiency of C4b deposition and efficiency of C3 deposition 
and cell lysis or terminal complement complex formation (Bindon et 
al, 1988a, 1990; Gan-ed et al, 1989). However, there is a striking 
exception. Clark et al. (1989a) found that a rat monoclonal anti-CD3 
IgG 2 b antibody with one nonfunctional light chain (and therefore 
monovalent) generated similar levels of cell-bound C3 but gave more 
lysis than did the parent divalent antibody. It would seem, therefore, 
that the triggering antibody molecule can influence the complement 
cascade at a stage after C3 deposition. Considering the interesting 
question of the relative abilities of IgG t and IgG 3 to mediate cell lysis, 
the following observations have been made. IgGi is considerably more 
effective than IgG 3 in mediating lysis of hapten (NlP)-coated red cells 
(Briiggemann et al., 1987; Bindon et al, 1988a). IgG 3 is more effective 
at mediating lysis of dansyl-coated red cells (Dangl et al, 1988). An IgGi 
directed against a surface antigen is more effective at lysis of lympho- 
cytes than is IgG 3 (Riechmann et al, 1988). An IgGi against a tumor 
cell line mediates lysis (Liu et al, 1987), whereas an IgG 3 against a 
different cell line does not (Shaw et al, 1987). The picture apparently 
emerging is therefore of IgGi being generally superior for lysis. 

However, the studies of Michaelsen et aZ. (1991) on red cells labeled 
with hapten (NlP)-anti-red blood cell (RBC) F(ab') suggest that the 
conditions of lysis, i.e., antigen density and antibody and complement 
concentration, are an important consideration here. Thus, they found 
anti-NIP IgGi to be superior to IgG 3 at high antigen concentration, but 
this order was reversed at lower antigen concentration. The amount of 
IgG 3 bound was less than that of the other subclasses so its relative 
potency is even greater. IgG 2 was only effective at the highest antigen 
densities. This latter observation is interesting in view of the pre- 
ponderance of IgG 2 antibodies in anticarbohydrate responses and the 
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tendency of carbohydrate antigens to be presented in high concentra- m 

tion on the surface of microorganisms. These studies did not determine ti 

where in the Clq binding, CI activation, and C3 and C4 activation (1 

steps the relative efficacy of the subclasses is established. The pre- b« 

vious studies on immune complexes (Garred et al, 1989) make the w 

Clq-binding step a likely candidate. a< 

Can the various studies on the relative efficacies of IgG3 and IgGi be cl 

reconciled simply on the basis of different experimental conditions? n< 

We believe probably not and the differences probably arise from dif- al 

ferences in presentation of the antigen to the complement system. In al 
particular, Briiggemann et al. (1987) and Bindon et al. (1988a) used 
NIP attached to kephalin and therefore, presumably, the hapten was 

close to the red cell surface, whereas Michaelsen et al. (1991) attached n; 

NIP to cell surface proteins directly or via Fab' at greater distances in a ™ 

different local environment. Therefore, in conclusion, it does not yet >r 

seem possible to assert that IgG t or IgG 3 is the most effective isotype U 

for complement lysis. This may depend on the antigen involved and a( 

the precise experimental conditions employed. C( 

Finally, it should be noted that in the case of lysis mediated by a pair tr 

of rat IgGs against the human leukocyte common antigen, efficient P 

lysis required -the alternative as well as the classical pathway (Bindon j" c 

et al, 1987). J> 

D. Antibodies as Activators of the Alternative Pathway ti 

Alternative pathway activation, in its simplest form, involves the 
generation of the alternative pathway C3 convertase (C3bBb) from C3 
and factor B in the presence of factor D. The activation, once initiated, 
has the potential for positive feedback and amplification. Factors I and 
H in the fluid phase act to regulate C3 convertase and prevent amplifi- 
cation. Properdin acts as a positive regulator. In the presence of a 
suitable activating surface, stabilization of the C3 convertase tips the 
scales in favor of amplification. A typical activating surface is that of a 
microorganism, although it is suggested that associated antibody may j 
also function in this way (reviewed in Ratnoffer al y 1983). 

In particular, it is often assumed that IgA can activate the alternative j. 
pathway, although this has been controversial (Kilian et al., 1988). ^ 
Recent literature maintains the controversy. Hiemstra et al. (1987) j 
reported that red cells coated with chemically aggregated human IgA 
were lysed by the alternative pathway. Rits et al (1988) found that both 
soluble and insoluble rat IgA immune complexes activated the alterna- 
tive pathway of homologous rat complement. Hiemstra et al (1988) p 
reported that human IgAi, IgA 2> secretory IgA, and the F(ab')2 frag- 
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m ent of IgAi coated onto microwells were able to activate the alterna- 
Hve nathway. Fab and Fc fragments were not. Schneiderman et a Z. 

99oTSed that a series of chimeric rabbit mouse IgA antibod.es 
hound to antigen activated the alternative, but not the class.cal path- 
way ^ Vahm and Lachmann (1991) found effective alternative pathway 
Tctivatl by immune complexes formed between BSA-NIP and 
cCeric human lgA 2 anti-NIP antibodies. The same •complexes did 
no trigger the classical pathway. lgG 2 was also found to activate the 
Xemative pathway less effectively at high epitope density and equiv- 

alence or antibody excess. 

On the other hand, Imai et al (1988) found that neither human IgA 
immune complexes nor covalently cross-linked IgA activated the alter- 
native pathway. Russell and Mansa (1989) reported that although hu- 
man IgA coated onto plastic surfaces activated the alternate pathway 
in a dose-dependent manner, IgA bound to antigen dul not. In contrast. 
IgG antibodies, either bound to antigen or coated directly onto plastic, 
activated complement, mainly by the classical pathway. The authors 
concluded that the complexation of IgA with antigen is insufficient to 
trigger the alternative pathway and argued rather ^t d enaturat.on 
plays a key role in IgA activation. Jarvis and McLeod Cnffiss (1989) 
found that human IgA, was unable to mediate alternative pathways 
lysis of Neisseria meningitidis but did mediate classical pathway lysis. 
The other antibody often associated with alternative pa^ay activa- 
tion is rabbit IgG when C3b has been found bound to Fab and the i^i 
domain of IgG (Gadd and Reid, 1981a; Anton et al, 1989). 



IV. Human Leukocyte Fe Receptors 

Receptors specific for the Fc region immunoglobulins are found on 
the surface of a variety of human leukocytes. The presence ot fc 
receptors confers on these immune cells the ability to mediate a num- 
ber of effector mechanisms important in the humoral response. Hecent 
cloning and sequencing studies have revealed that the large majority 
of mammalian Fc receptors have evolved as part of the immunoglobu- 
lin gene superfamily. Here those Fc receptors specific for IgO U-c, 
receptors) will be discussed first, followed by those specific for other 
classes of immunoglobulin. 

A. Fc v Receptors 

Three classes of human Fa, receptor have thus far been described: 
Fc^RI, Fc T RII, and Fc^RIII (Unkeless et al, 1988). All three ap- 
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pear capable of mediating a number of protective functions against 
antibody-coated infectious agents (Pound and Walker, 1990; Van de 
Winkel and Anderson, 1991). Studies using hybridomas expressing 
surface anti-Fc^R monoclonal antibodies as targets have demonstrated 
that Fc y RI, Fc^RII, and the macrophage/NK cell form of Fc^RIII can 
mediate antibody-dependent cell-mediated cytotoxicity (ADCC) (Fan- 
ger et al, 1989). These same receptors can also mediate phagocytosis 
(Huizinga et al, 1989c; Anderson et al, 1990a), whereas Fc^RI, 
Fc^RlI, and the neutrophil form of Fc^RIII have been shown to trigger 
an oxidative burst (Anderson et al, 1986; Crock ett-Torabi and Fan- 
tone, 1990; B. A. M. Walker et al, 1991). 

1. Fc y Rl 

Human F^RI (CD64) is a 72-kDa glycoprotein expressed consti- 
tutively on monocytes and macrophages. It may be induced on neutro- 
phils in vitro by treatment with interferon-y (IFN-y) (Perussia et al, 
1983). IFN-y treatment also up-regulates Fc^RI expression on mom> 
nuclear phagocytes on the FCyRI-bearing monocytic cell lines, U937 
and HL60 (Guyre et al, 1983). 

Three cDNA clones for human Fc y RI have been isolated using a 
ligand-affinity cloning technique (Allen and Seed, 1988, 1989). Two 
clones represent polymorphisms whereas the third has a shorter pre- 
dicted intracytoplasmic domain (Fig. 15). In each case, the deduced 
amino acid sequence indicates an integral membrane protein with a 
single hydrophobic membrane-spanning region. The extracellular 
portion is composed of three immunoglobulin-like domains, the first 
two of which exhibit homology to the two extracellular domains of 
human Fc 7 RII and Fc^RIII (see later). The third domain, nearest to the 
membrane, is less closely related, 

2. Fc y RI~IgG Interaction at the Molecular Level 

FCyEl is sometimes referred to as the high-affinity Fey receptor, 
because of the three human receptors it is the only one displaying 
appreciable affinity for monomeric IgC. It binds monomelic human 
IgC,andIgG 3 withaK a of-5 x 10 8 M~ l (Fries et al, 1982; Kurlander 
and Batker, 1982). The affinity for human IgC 4 is approximately 10-fold 
lower and human IgG 2 does not bind ( Woof et al, 1986). Human Fc^RI 
appears to bind aggregated IgC, at least in heat or chemically cross- 
linked forms, with similar affinity to monomeric IgG (Cosio et al, 
1981; Carter et al, 1982; Kurlander and Batker, 1982; Woof et al, 
1986). This finding, together with the fact that the high serum concen- 
tration of monomeric IgG presumably results in constant saturation of 
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pic 15. Schematic representation of the structures of human Pc, receptors The 
extracellular immunoglobulin-like domains, shown as oval shapes, each have an interna 
disulBde bond. No structural information is yet available for cytoplasm* doma.ns. but 
their relative lengths are indicated. PIC, Phosphatidylinositol-glycan. 



FcJU, raises the question of how Fc y RI is able to distinguish antibody- 
coated infectious agents. One possibility is that Fc 7 RI may play an 
important role at tissue sites where monomeric IgG is limiting. Alter- 
natively, up-regulation of Fc^RI expression by IFN-7 at inflammatory 
sites may influence the function of the receptor. 

The high affinity of Fc^RI for monomeric IgG has facilitated at- 
tempts to localize the Fc^RI interaction site on IgG. Earlier studies had 
suggested that the site lay in the domain of IgG (Okafor et al, 1974; 
Ciccimarra et al, 1975). However, the use of highly immuno- 
affinity-purified IgG fragments and domain-deleted IgG paraproteins 
indicated that this was not the case (Woof et al, 1984). Further, loss of 
the N-linked carbohydrate moieties from the C y 2 domain ot IgG was 
shown to result in a marked reduction (>50-fold) in affinity for Fc^RI. 
This suggested an important role for the C r 2 domain in the interaction, 
because aglycosylation was deemed more likely to perturb struc- 
ture than that of CJJ (Leatherbarrow et al, 1985; Walker et al, 1989a). 
As mentioned earlier, recent NMR studies have verified this assump- 
tion (Matsuda et al, 1990). 

Direct evidence for the involvement of the domain of IgG was 
provided by experiments in which a panel of antihuman IgG monoclo- 
nal antibodies were assessed for ability to inhibit the IgG-Fc y RI inter- 
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action and to bind to receptor-bound ligand (Partridge et al, 1988). 
Only those monoclonal antibodies recognizing epitopes atthe N- 
terminal end of the C y 2 domain both blocked IgG binding to Fc^RI and 
failed to bind receptor-bound IgG. Recent "domain swap" experi- 
ments in which the C r 2 and Cy3 domains of human IgGi, either singly 
or together, substitute for one or both of the homologous C € 3 and C t 4 
domains of mouse IgE have generated chimeric IgG/IgE molecules 
(Shopes et al, 1990). In the series of mutants produced, all those 
lacking domains did not inhibit the FcyRI-IgG interaction, 
whereas three out of four IgG/IgE constructs possessing domains 
did inhibit. These results confirm the importance of the C r 2 domain for 
interaction with FcyRI. Futher, calculations of the relative energetic 
contributions of each domain revealed that the C y 2 domain contributes 
about 73% of the overall drop in free energy seen on binding. In 
contrast, the C y 3 domain was shown to contribute maximally about 
25% of the free energy drop upon binding. This is consistent with a 
requirement for the presence of the Cy3 domain of IgG for Fc r RI 
binding, earlier suggested by the lack of reactivity of a C r 3-deleted 
human IgGi paraprotein with the receptor (Woofer al, 1984). The C y 3 
domains would appear to serve to stabilize the overall structure of the 
Fc, necessary for optimal FCyRI binding. 

Another approach assessed the ability of various IgG molecules, 
from different species and of different subclasses, to inhibit the inter- 
action of radiolabeled human IgG with Fc y RI. Amino acid sequence 
comparison of the C y 2 domains of these IgGs then outlined potential 
Fc y RI-binding sites that fulfilled the requirements of solvent accessi- 
bility and conservation in only the tight binding IgGs. This procedure 
highlighted a region at the N-terminal end of C y 2 comprising residues 
Leu 234-Ser 239 which appears to be critical for interaction with 
Fc Y RI (Woof et al, 1986). This region, although encoded by the C y 2 
exon, structurally forms part of the lower portion of the hinge, lying 
mostly beyond the extent of the solved crystal structure of Fc, which 
stops at residue 238 (see earlier). 

The proposed site, Leu 234-Leu 235-Gly 236-Gly 237-Pro 238- 
Ser 239, is present in all IgG isotypes with high affinity for Fc r RI, 
namely, human IgGi and IgC 3 , mouse IgGza, rat IgG 2b , and rabbit IgG^ 
In the weaker binding human IgG 4 , residue 234 becomes Phe and in 
mouse IgG 2b residue 235 becomes Glu. This latter IgG displays little 
or no binding to human Fc T RI. However, a mutant mouse IgG ? b mole- 
cule, generated by site-directed mutagenesis in which Glu 235 is 
converted to Leu, displayed a greater than 100-fold increase in affinity 
for the receptor. Indeed, Scatchard analysis of direct binding measure- 
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ments showed it to have an affinity comparable to that of human IgGi 
(Duncan et al t 1988). More recent mutagenesis experiments have 
concentrated on the high-affinity human IgG 3 subclass. Substitutions 
at residues 234-237 inclusive resulted in reductions in ability to in- 
hibit the IgG-Fc^Rl interaction (Lund et a/., 1991). The most marked 
effect is seen with substitution at position 235, with replacement of the 
Leu with Glu giving a > 100-fold decrease in affinity. Substitution of 
Ala for residues Leu 234, Gly 236, and Gly 237 generated antibodies 
with affinities reduced -4-fold, -4-fold, and -30-fold respectively. 
Rosetting studies with U937 cells confirmed these binding inhibition 
measurements. Taken together, these findings confirm the ^importance 
of the lower hinge region in the interaction of IgG with Fc^RI (Pig. 
2). Evidently, only a single site (i.e., one heavy chain) is required 
for interaction with a single Fc^RI molecule (O Grady et al, 1986; 

Koolwijk et aL, 1989). . 

Studies using an array of further IgG mutants appear to be in broad 
agreement with the above-cited work (Canfield and Morrison, 1991). A 
mutant human IgG 2 molecule, in which the C y 3 domains of human 
IgG 3 were substituted for its own, did not bind to Fc Y RI. A human IgG 3 
bearing the C y 3 domains of human IgG 2> however, did bind Fc y RI, 
with an affinity comparable to that of wild-type IgC 3 . Thus, the critical 
importance of the C y 2 domains in the interaction is reaffirmed. Substi- 
tution of Glu for Leu at position 235 in human IgG 3 again resulted in a 
> 100-fold reduction in affinity for Fc y RI. Conversion in human IgG 3 of 
residue 234 to Phe, as found in human lgG 4 , generated a mutant 
molecule with affinity equivalent to that of wild-type human IgG 4 . The 
reciprocal experiment, in which Phe 234 in human IgG 4 was converted 
to Leu, generated an IgG 4 molecule with increased affinity for Fc^RI, 
about threefold lower than that of human IgG 3 . This slight shortfall in 
affinity compared to human IgC 3 prompted the authors to search else- 
where in the C^ domain for amino acid differences that might serve as 
an explanation (Canfield and Morrison, 1991). Thus it was proposed 
that residue 331 (Pro in all tight-binding IgGs but Ser in human IgG 4 ), 
located on a loop lying close to the lower hinge site, might make some 
contribution to the Fc r RI interaction. The substitution experiments, 
however, yielded somewhat inconclusive results and further investi- 
gation into the possible role of this loop would be interesting. 

A second bend, lying close to the lower hinge region in molecular 
models, may also be of importance. Proton NMR revealed that on 
aglycosylation of Fc the minor structural changes in the C y 2 domain 
were sensed predominantly by His 268, which lies in this second bend 
(Matsuda et al, 1990). The marked reduction in affinity for Fc 7 RI 
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accompanying aglycosylation may be explained in terms of perturba- 
tion of this lower hinge-proximal loop (Lund et al, 1990). 

Turning to the interaction site on the receptor, presently rather little 
information is available. The third domain of Fc y RI shares less homol- 
ogy with the two extracellular domains of the low-affinity receptors, 
FcyRII and FoyRIII, than do the first two domains. This perhaps sug- 
gests a role of this third domain in conferring high affinity on Fc^RI. 
Indeed, preliminary studies introducing point mutations into the third 
domain are reported to confirm this (Allen and Seed, 1989). Further, 
studies on the mouse homologue; mouse Fc^RI, which also possesses 
three extracellular domains, suggest that domain 3 confers high affinity 
and specificity on domains 1 and 2, which form a low-affinity IgG- 
binding motif (Hulett et al, 1991). 

Figure 16 shows a model of the interaction of human FcyRI 
(modeled as three IgG domains) with an IgGi molecule simulta- 
neously binding to antigen (hemagglutinin on the surface of a virally 
infected cell). Although admittedly speculative, such models serve to 
demonstrate the relative sizes of the molecules involved and highlight 
conformation constraints imposed by the location of interaction sites 
on the molecules. 

3. FCyRU 

Human Fc 7 RII (CD32), a 40-kDa glycoprotein, has the most wide- 
spread distribution of the three human receptors for IgG, being 
present on monocytes, macrophages, eosinophils, platelets, neutro- 
phils, basophils, and B cells. Several cDNA clones coding for this 
receptor predict a transmembrane molecule with two immuno- 
globulin-like extracellular domains (Stuart et al, 1987, 1989; Hibbs et 
al, 1988; Stengelin et al, 1988). Fa.RH is encoded by a minimum 
of three genes (IIA, IIB, and IIC) that yield six distinct transcripts 
(Brooks et al, 1989). Gene IIB gives rise to three transcripts, termed 
Fc 7 RIIbl, Fc Y RIIb2, and Fc 7 RIIb3, by alternative slicing of cytoplas- 
' mic exons or of signal sequence exons. Transcripts of IIB have been 
demonstrated by a combination of Northern blotting and polymerase 
chain reaction (PCR) amplification after reverse transcription in macro- 
phages, monocytes, and B lymphocytes, with lower levels found in 
neutrophils (Brooks et al, 1989). Fc 7 RIIb expression has also been 
demonstrated in placental trophoblasts (Stuart et al, 1989). 

The three other Fc y RII transcripts (two FCyRIIa transcripts and one 
FoyRIIc) are derived from two further genes referred to as FCyRIIA and 
Fc 7 RIIC (Brooks et al, 1989; Stuart et al, 1989). Alternative polyade- 
nylation in Fc^RIIA gives rise to two transcripts. All three molecules 
are highly homologous to Fc 7 RIIb, with differences being restricted to 
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Fic. 16. Antibody as adaptor linking target and effector cells. In the lower part of the 
picture, the target cell infected by influenza virus is shown expressing two hemaggluti- 
nin molecules at its surface. Hemagglutinin (Wilson et ai, 1981) is recognized by the 
Fab arms of the IgG molecule. In turn, the IgG molecule is recognized at the site seen in 
Fig. 2 by an Fc receptor molecule anchored to the cell membrane of the effector cell. The 
IgG molecule as shown is dislocated in the sense that the Fc part is in a plane roughly at 
right angles to that containing the Fab arms. The Fc receptor, FcRI, is known from the 
work of Allen and Seed (1989) to consist extracellularly of three immunoglobul in-like 
domains. The structure as shown and antibody binding to the outermost domain are 
speculative. The effect of opening of the Fab arms (opening the Y) and of dislocation is 
that the antibody molecule tends to draw the target and effector cells closer together than 
many other possible arrangements. The use of one of the inner domains of FcRI for 
antibody binding would enhance this effect still further. 
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the signal sequence and part of the cytoplasmic domain in Fc^RIIa, 
and to a portion of the cytoplasmic domain in Fc^RIIc. Ignoring differ- 
ential polyadenylation, the Fc y RIIa and Fc^RIIc transcripts are almost 
identical apart from their signal sequences. The leader sequence of 
Fc^RIIc is homologous to that of FcyRIIb, whereas that of Fc^RIIa 
shows homology to that of human Fc^RIII (see later). Thus, IIC may 
have evolved initially by gene duplication and mutation from an ance- 
stral IIB gene, with a later recombination event between IIC and 
FcyRIII genes giving rise to IIA. 

Transcripts of IIA have been found readily in monocytes and neutro- 
phils but less prominently in lymphoid cell lines, whereas sensitive 
PCR amplification techniques have detected FcyRIIc transcripts in 
neutrophils, monocytes, and B cells (Brooks et al, 1989). Thus, there 
appears to be preferential expression of FCyRIIA in neutrophils and 
Fc r RIIB in lymphocytes, with both forms being expressed in mono- 
cytes (Ravetch and Anderson, 1990). 

The mature protein products predicted from the cDNAs of Fc^RIIa, 
Fc 7 RIIc, and Fc^RIIb are closely related. The predicted Fc^RIIa and 
Fc^RIlc proteins are virtually identical (>95%). The extracellular and 
transmembrane domains of the Fc^RIIb receptors share a high degree 
of homology with the predicted FcyRIIa and FcyRIIc receptors. 
However, beyond the first 10-12 residues of the intracellular regions 
the similarity between Fc^RIIb and FcyRIIa/IIc ends and highly di- 
vergent cytoplasmic tails result (Fig. 15). The conserved nature of the 
extracellular domains together with the diversity of the intracytoplas- 
mic regions suggests that this family of receptors may mediate an array 
of different functions in response to recognition of the same ligand. 

4. Fc y RU-lgG Interaction at the Molecular Level 

The affinity of human Fc^RII for monomeric IgG is too low to be 
easily experimentally determined (K* < 1 x M^AT* 1 ) (Karas et al, 
1982; Kurlander et al, 1984) but binding to complexed IgG is much 
tighter [fordimers of human Igd, K fl - (2-5) x 10 7 M" 1 1 (Karas et al, 
1982; Van de Winkel and Anderson, 1991). Recently, the affinity of 
Fc^RII for complexed IgG has been shown to increase after treatment 
with proteolytic enzymes such as pronase and elastase (Van de Winkel 
et al, 1989b). This process may serve to "activate" FcyRII at inflam- 
matory sites (Tax and Van de Winkel, 1990). 

Human Fc^RII is specific for complexes of human Igd and IgG 3 and 
appears not to bind IgG 4 (Karas et al, 1982; Walker et al,. 1989b; Van 
de Winkel and Anderson, 1991). Reports on the affinity of Fa,,RII for 
complexed human IgG 2 appear to give conflicting results, with some 
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workers reporting binding (Van de Winkel and Anderson, 1991) and 
Others not (Walker et al, 1989b). An explanation, now emerging, is that 
T e ability to bind human IgG 2 appears to reside with only one allelic 
farm of FcJUI. termed low responder (see below) (Warmerdam et al, 
1091) FcJUI also has affinity for mouse IgC 2b but not for mouse IgG 2a . 
A functional polymorphism of Fc.RlIa has been demonstrated by dif- 
ferent assays on monocytes (Tax et al., 1983; Leeu iwenberg et al. 1987; 
Van de Winkel et al, 1987, 1989a; Debets et al, 1990), neutrophils 
(Leeuwenberg et al, 1990; Gosselin et al, 1990), and alveolar macro- 
phages (Kindt et al., 1991). The two allelic forms of ^ Rlla f ! nv ° k lve , d 
differ in their ability to bind complexed mouse IgGi-the hign- 
resDonder (HR) form binds whereas the low-responder form (LR) does 
not (Anderson et al, 1987). A monoclonal antibody 41H16, has been 
shown to be specific for the HR form (Gosselin et al, 1990). 

Recendy cDNA clones coding for HR and LR forms of Fc.RIIa have 
been isolated and sequenced (Clark et al, 1989b; Warmerdam et al, 

1990) . The alleles appear likely to differ in just two residues; residue 
27 in domain 1 is Gin in HR and Trp in LR forms, and residue 131 in 
domain 2 is Arg in HR and His in LR forms. Domain swap experiments 
between the two forms indicate that Arg 131 is a critical requirement 
for the binding of mouse IgGi to the HR form (Warmerdam et al., 

1991) . Arg 131 also appears to form part of the epitope recognized by 
41H16. Further, these studies indicate that His at residue 131 in the 
LR receptor is essential for the binding of this form to human IgG 2 . 

Loss of the N-linked carbohydrate moieties from the C y 2 domain ot 
human IgGi and IgG 3 molecules results in a marked reduction in their 
ability to interact with Fc 7 RIl (Walker et al, 1989b). This effect is 
reminiscent of that seen with F^RI and therefore suggests the impor- 
tance of the domain in the Fc y RII-IgC interaction. The^ localized 
perturbation on aglycosylation particularly in the vicinity of His 2bo\ as 
assessed by the proton NMR methods mentioned earlier (Matsuda et 
al, 1990), perhaps further points toward a similarity between Fc^RI 
and Fc^RII interaction sites. In order to assess this possibility, the 
earlier described panel of mutant IgG 3 molecules with point substi- 
tutions in residues 234 to 237 was utilized. The ability of these anti- 
bodies to form rosettes with the Daudi cell line, which expresses 
FcyRII only, was determined and compared to that of wild-type human 
IgG 3 (Lund et al, 1991). Substitutions of Leu - Ala 234, Leu -> Ala 
235, Leu - Glu 235, Gly -> Ala 236, and Gly -» Ala 237 in each case 
reduced the number of rosettes formed with Daudi cells. The most 
marked effect was seen with Leu Ala substitution at residue 234, 
where rosette formation was reduced to <10% of that of the wild-type 
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IgG 3 ..The fact that point substitutions in this lower hinge region obvi- 
ate, at least to some degree, both Fc^RI and Fc^RII binding suggests 
that the receptors recognize overlapping interaction sites. However 
the subtle differences seen in the rosetting profiles of the two receptors 
with the mutant IgG 3 panel indicate that the interaction sites do not 
appear to be strictly identical. Hence, the presence of Leu 234 would 
seem to be most critical for Fc y RII recognition whereas that of Leu 235 
appears to be more important for Fc y RI binding. 

Recently, as mentioned earlier, chimeric IgG/IgE molecules have 
been generated in which the CJL and C y 3 domains are interchanged 
either singly or together, with the C e 3 and C E 4 domains (Shopes et a!., 
1990). The results obtained when these chimeras were assessed for 
ability to mediate rosette formation with the FcyRI", Fc y RII + K562 
cell line may at first appear to be in conflict with the above. Only 
molecules bearing both C y 2 and C y 3 domains were capable of mediat- 
ing rosette formation, suggesting a requirement for both domains in 
the interaction. The C y 3 domain would thus appear to be contributing 
more than merely a stabilizing effect on the structure of the C y 2 do- 
main because substitution with the homologous C € 4 domains in the 
y/C t 4 chimera failed to produce a molecule capable of rosetting. 
However, the contribution of C t 4 to the overall stability of the chimeric 
Fc may be less than that from the native Cy3. The reduced affinity of 
y/CA for Fc y RI would indeed be consistent with this idea. Hence, if 
the y/C e 4 chimera had even a slightly reduced affinity for Fc y RII this 
may take it below the threshold necessary to mediate rosette formation 
in this purely qualitative assay. Therefore, .contrary to first appear- 
ances, the results with these chimeric molecules may still be consis- 
tent with a critical role for the lower hinge region of the C y 2 domain in 
the FcyRII-IgG interaction. 

5. FcyRUI 

The third class of human Fc receptor for IgG, Fc y RIII (CD16), is a 
glycoprotein present on neutrophils, macrophages, cultured mono- 
cytes, and NK cells. Recently, reactivity with anrj-Fc y RIII monoclonal 
antibodies (3G8, CLB Gran 1, and B73.1) has indicated the presence of 
this receptor on kidney mesangial cells and on placental trophoblasts 
(Sedmak et al, 1990, 1991). Fc^RIIl has also been demonstrated on a 
small subpopulation of freshly isolated monocytes (Passlick et ai, 
1989; Anderson et al, 1990b). On sodium dodecyl sulfate (SDS) gels it 
appears as a broad band of 50-80 kDa. 

In humans, Fc y RIII is encoded by two genes, termed Fc y RIII-A 
and FCyRIII-B, each giving rise to a single transcript (Ravetch and 
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Pprussia 1989). Alternatively, the two genes have been termed 
fLrIH-1 and Fc^RIII-2, where III-l is equivalent to III-B and III-2 is 
equivalent to III-A. III-B is expressed exclusively in neutrophils, 
wherein the receptor is anchored to the plasma membrane via a 
phosphatidylinositol-glycan linkage. III-A is expressed in NK cells, 
macrophages, and cultured monocytes and encodes a transmembrane 
molecule with a 25-amino acid cytoplasmic region (Huizinga et al 
1988; Selvaraj et al, 1988, 1989; Scallon et al, 1989). Both forms of 
receptor possess two extracellular immunoglobulin-like domains ig. 
15) The two genes are, in fact, nearly identical, with only nine nucleo- 
tide substitutions. These give rise to six amino acid differences in the 
two proteins, as well as the loss of 21 residues from the C-terminus of 
the III-B receptor. A single amino acid substitution lying at the site of 
phosphatidylinositol-glycan (PIG) linkage accounts for the alternative 
topological forms of Fc 7 RIIL Thus, Ser at position 203 in III-B results 
in a PIG-linked receptor whereas Phe 203 in III-A does not permit this 
mode of anchoring to occur. A transmembrane form of FCyRIII there- 
fore results (Hibbs et al, 1989; Kurosaki and Ravetch, 1989; Lanier et 

al, 1989a). J tit o 

The structural differences between Fc^RIII-A and III-B mamtest 
important functional distinctions. For example, Fc^RHI-A receptors 
on NK cells and macrophages can mediate phagocytosis and ADCC, 
but reports suggest that Fc y RIII-B cannot (Fanger et al, 1989; Selvaraj 
et al, 1989; Anderson et al t 1990a). Fc^RIII-B on neutrophils may 
therefore serve as a highly mobile "trap" for multivalent ligand, allow- 
ing efficient presentation to FcyRIl with subsequent mediation of 
effector functions. However, recent reports suggest that Fc,RIII-B 
may, by as yet undefined means, independently initiate certain func- 
tions (Kimberly et al, 1990). Multivalent ligation of FcyRIII-B, for 
example, results in a [Ca 2+ l r dependent increase in actin filament 
assembly, known to be a prerequisite of phagocytosis and secretion 

(Salmon et al, 1991). 

Two allelic forms of Fc^RIII, termed NA1 and NA2, exist on neutro- 
phils. FcyRIlI on NK cells and macrophages, however, is restricted to 
the NA2 allele (Tetteroo et al, 1988; Huizinga et al, 1989b; Salmon et 
al, 1990; Kindt et al, 1991). Comparison of nucleotide sequences 
coding for the allelic forms revealed minor differences, which result in 
the loss of glycosylate sites in NA1 (Ory et al, 1989; Ravetch and 
Perussia, 1989). 

Fc^RIII-A has been demonstrated to associate with a dimer ol 
another small polypeptide, either the y subunit of the high-affinity 
Fc receptor for IgE (Fc t RI; see later) or the CD3£ chain of the T 
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cell receptor (Lanier et al, 1989b; Ra et ai.,1989a). These smaller 
molecules form part of a family of related proteins that associate as 
disulfide-linked dimers (Orloff et al, 1990). The interaction of 
Fc 7 RIII-A with homo- or heterodimers of y and { appears to be critical 
for optimal expression, as demonstrated by experiments in COS cells 
(Hibbs et al, 1989; Kurosaki and Ravetch, 1989). The sites of interac- 
tion between Fc^RIII-A and y or £ lie in their transmembrane regions 
(Farber and Sears, 1991; Lanier et al, 1991). Recendy, 4 activation" of 
Fc r RlII in NK cells by interaction with complexed IgG has been 
shown to result in phosphorylation of the associated £ chain. Hence, 
the £ chain acts as a transducing element in mediation of responses of 
Fc^RIII-A occupation by complexed IgG (O'Shea et al, 1991; Vivier et 
al, 1991). 

6. lgG-FCyRIII Interaction at the Molecular Level 

The affinity of Fc^RIII-B on neutrophils for monomeric IgG is too 
low to be readily experimentally determined. Studies using IgG di- 
mers of different human subclasses have indicated that Fc^RIII-B 
binds Igd and IgG 3 but not IgG 2 or IgG 4 (Huizinga et al, 1989a). 
Estimates of affinity constants for human IgGi dimers range from 
3.6 x 10 6 M~ l (KurlanderandBatker,1982)toll x 10 7 M^ 1 (Huizinga 
et al, 1989a). The former value may be lower due to the glutaral- 
dehyde treatment used to cross-link IgG molecules in the experiment. 
The authors of the latter investigation highlight the possibility that 
their measurements may be a composite value, with contributions from 
both Fc y RIII and Fc y RII. 

Recently, studies of Fc 7 RIII-A on NK cells and macrophages suggest 
that this form of the receptor may have a higher affinity for human IgG 
than does Fc r RIII-B (Van de Winkel and Anderson, 1991). The anti- 
Fc 7 RIII monoclonal antibody, 3G8, can be displaced from Fc r RIII-A 
by monomeric IgG, but not from Fc y RIII-B (Anderson et al, 1990b), 
Further, different glycoforms of FcrRIII-A may vary in their affinity for 
ligand. The high-mannose-containing FayRIII-A on NK cells appears 
to have a higher affinity for IgG than does FcyRIII-A on monocytes 
because the latter receptor, which lacks high-mannose oligosaccha- 
rides, requires higher concentrations of IgG to displace 3G8 (Kimberly 
et al, 1991). Studies on NK cells show a specificity profile for mouse 
IgG subclasses of IgG 3 > IgCza > IgG 2b » IgCi (Kipps et al, 1985; 
Anasetti et al, 1987). 

FcyRIII is able to interact with certain lectin molecules, probably via 
its high-mannose-type oligosaccharides. Thus, the phagocytosis of 
Con A-treated erythrocytes by neutrophils is inhabitable by aggregated 
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IgG 3G8, and monosaccharides such as r>mannose (Salmon et.al, 
1987; Kimberly et al, 1989). Thus, the oligosaccharide moieties recog- 
nized by Con A have been suggested to contribute to the integrity of 
the IgG-binding site on Fc^RIII. 

Turning to the interaction site on IgG, early experiments indicated 
that the integrity of the Fc region was required for binding to neutro- 
phil Fc^RIII. Hence, neither a pFc' fragment nor a tryptic C^ domain 
fragment could inhibit the interaction (Bamett-Foster et al, 1978). The 
importance of the G^/CV* interface was suggested by the reported 
ability of protein A, which binds in this region, to block the binding of 
IgG to the receptor (Bamett-Foster et al, 1982). However, other 
workers found an anti-IgG monoclonal antibody, which recognizes an 
epitope in the C^/G^ interface region, unable to inhibit the binding 
of IgG to Fc^RIII on monocyte-depleted peripheral blood mononu- 
clear cells (Sarmay et al, 1985). This same study, using a panel of 
anti-IgG monoclonal antibodies and the same effector cells, proposed 
that the FCy receptor involved, assumed to be Fc y RIII, binds to a 
region in the domain of IgG and that a second region in the C 7 2 
domain is critical for triggering of ADCC via this receptor. More re- 
cently, aglycosylation of Fc of human IgG 3 has been reported to render 
it incapable of mediating ADCC via Fc y RIII on K cells (T plus Null 
cells) (Lund et a/.,"1990). Currendy, however, our understanding of 
how IgG and Fc^RlIIl interact at the molecular level seems to require 
further clarification. 

B. Fc e Receptors 

Two classes of receptor specific for the Fc region of IgE, termed 
Fc t RI and Fc t RII, have been described and will be discussed here in 
turn. 

I. Fc«RJ 

Human FcJU is found exclusively on the surface of mast cells and 
basophils, where it binds IgE with high affinity. Upon aggregation of 
Fc e RI-IgE complexes by interaction with multivalent antigen the cell 
degranulates, releasing mediators of the allergic response (Metzger et 
al, 1986; Metzger, 1988). Fc t RI is a multichain glycoprotein consisting 
of one a subunit, one 0 subunit, and two y subunits (Metzger et al, 
1983; Alcarez et al, 1987) (Fig. 17). The a subunit, which contains the 
binding site for IgE, is an integral membrane protein with a single 
membrane-spanning region and two immunoglobulin-like extracellu- 
lar domains. cDNA clones for the human a subunit have been isolated 
(Kochan et al, 1988; Shimizu et al, 1988). The predicted product 
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Fic. 17. Schematic representation of Fc.RI. The receptor is a tetramer of one a sub- 
unit, one subunit, and two y subunits. 



shares a considerable degree of homology to human FcyRIII-A. Be- 
tween species, the most highly conserved region is the transmembrane 
portion, with the cytoplasmic tail showing most divergence (Kinet, 
1989). These observations may suggest that the a subunit intracellular 
region is not involved in a key function, whereas the membrane- 
spanning portion serves some specific role, perhaps involving interac- 
tion with the p or y subunits (Kinet and Metzger, 1990). Indeed, recent 
results from experiments expressing wild-type and mutant forms of the 
rat Fc e RI in COS ceils support these suggestions and will be discussed 
a little later. 

No sequence data is currently available for the human 0 subunit 
However, cDNA clones of both the rat and mouse £ chains have been 
isolated (Kinet et ai, 1988; Ra et al, 1989b). The predicted protein 
products share an extremely high degree of identity (83%), suggesting 
that the human counterpart may also do so. The predicted topology of 
the 0 subunit comprises four membrane-spanning regions with both isl- 
and C-termini in the cytoplasm. 

Both cDNA and genomic clones of the human y subunit have been 
isolated (Kuster et al, 1990). The y subunit appears to be well con- 
served, with 86% identity between the predicted polypeptides of hu- 
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man mouse, and rat y chains. This small third Fc,RI subunit spans the 
membrane once to give a short N-terminal extracellular region of just 5 
amino acids and a longer cytoplasmic tail (42 amino acids) (Kuster et 
al 1990). As mentioned earlier, the y subunit displays homology to the 
r chain of CD3 and both are members of a family of proteins that 
associate as disulfide-linked dimers (Orloff et al, 1990). Recently it has 
been shown that human CD3{ can substitute for the y subunit in 
assembly and functional expression of rat Fc« Rl in a Xenopus oocyte 
expression system (Howard et al, 1990). Similarly, both y and ? dimers 
may associate with human Fc r RIH-A (see earlier). 

A series of experiments in which component subunits of the receptor 
were cotransfected into COS cells to reconstitute rodent, human, or 
chimeric Fc.RI molecules have yielded interesting information on 
requirements for efficient cell surface expression. In both the rat and 
mouse systems, cotransfection of the a, |3, and y subunits are necessary 
for expression of Fc e Rl at the cell surface (Blank et al, 1989; Ra et al, 
1989b). HoweveT, cotransfection of human a and y subunits is suffi- 
cient to give efficient expression of a molecule capable of binding IgE 
(Kuster etal, 1990). Addition of rodent /3 chain did not increase expres- 
sion efficiency. Chimeric receptors of human o subunit plus rat 0 and 
y of either rat or muuse have also been expressed and shown to bind 
IgE with affinity comparable to that of Fc t RI on normal cells (Miller et 

al, 1989; Ra et al, 1989b). 

Site-specific mutagenesis of the a, /3, and y subunits of a rat b c t Ki 
expressed in COS cells revealed that removal of cytoplasmic tails from 
any or all of the subunits had little effect on surface expression. 
However, even minor changes within the transmembrane regions led 
to reduced expression (Varin-Blank and Metzger, 1990). In the rat 
receptor, therefore, the membrane-spanning regions of the subunits 
appear critical for optimal expression. By contrast, cotransfection of 
human o subunit and a truncated rat y subunit lacking a cytoplasmic 
domain resulted in no expression of human a, suggesting that the 
human and rodent receptors may assemble differendy (Varin-Blank 
and Metzger, 1990). Models to describe the molecular interaction be- 
tween transmembrane segments of the subunits are now emerging 
(Varin-Blank and Metzger, 1990; Farber and Sears, 1991). 

2. Fc«R/-/gE Interaction at the Molecular Level 
Human Fc.RI binds human IgE with high affinity (K a ~ 1 * 10 1 " 
M~ l ) and interacts somewhat more weakly with rat and mouse IgE 
(Conrad et al, 1983). A chimeric a subunit consisting of the extracellu- 
lar portion of the human a subunit fused to the transmembrane and 
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cytoplasmic domains of the p55 1L-2 receptor has recently been shown i n | 

to bind IgE with high affinity. Thus, the interaction site on the receptor H< 

appears to lie in the extracellular portion of the. a subunit, with no 0 n 

apparent contribution from the other subunits (Hakimi et al f 1990). tui 

Using the same chimeric receptor, it has been further demonstrated S e: 

that monoclonal antibodies recognizing epitopes in the second domain foi 

of the a subunit (adjacent to the cell membrane) can inhibit the interac- bi) 

tion with IgE (Riske et ai, 1991). This may suggest that IgE also binds 19 

directly to the second a-subunit domain. Alternatively, the inhibiting sit 

antibodies may exert their effect by steric hindrance of a site distal to m« 

their point of recognition or by induction of conformational changes in Ig 

the IgE-binding region. The carbohydrate moieties of the a subunit dc 

appear not to be involved in the IgE interaction because rat Fc e RI on to 
basophilic leukemia cells cultured in the presence of tunicamycin, an 

inhibitor of N-linked glycosylation, are still able to bind IgE (Hemp- m 

stead et of., 1981). gr 

Turning to the site of interaction on IgE, an excellent review of the ac 

various studies attempting to localize the binding site appeared some 3 lo 

years ago (Metzger, 1988). Here, therefore, we will concentrate on si 

advances made since then. The binding site for Fc € RI is known to lie in th 

the Fc portion of IgE, which is composed of the paired C t 2, C e 3, and a ' 

the C e 4 domains (Fig. 8) (Ishizaka and Ishizaka, 1975). The importance kl 

of the C e 2/C £ 3 interface was earlier suggested by the finding that d< 

interaction of rat IgE with rat Fc £ RI protected that region of the anti- c 

body, in particular from tryptic proteolysis (Perez-Montfort and Metz- * 

ger, 1982). Further mapping of the human Fc e RI site has made use of sl 
recombinant peptides of human IgE expressed in Escherichia coli. 

The lack of glycosylation of Fc £ fragments generated in this manner ^ 

does not impede their ability to interact with FcJU (Ishizaka et ai, Ij 
1986). This is consistent with an earlier report that nonglycosylated 

intact rat IgE still bound to rat Fc e RI (Kulczycki and Vallina, 1981). A f( 

monomelic recombinant peptide of 76 amino acids, comprising resi- r< 

dues 301-376 spanning the C e 2/C £ 3 interface, was reported to interact " 

with human Fc e RI with an affinity similar to that of intact IgE (Helm et r ' 

al. } 1988). Use of a series of peptides subsequently demonstrated that c 

residues 363-376 in the above peptide are not essential for Fc £ RI . 

binding (Helm et ai, 1989). Further, an epitope lying within this ! : 

peptide recognized by an anti-IgE monoclonal antibody demonstrates ^ 

a sensitivity to heat and alkylation similar to that displayed by IgE * 
when interacting with Fc e RI (Del Prado et al, 1991). 

A recent report suggests that an octapeptide lying at the C e 3/C e 4 1 

interface within the peptide above (residues 345-352) is capable of r 
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inhibiting histamine release by human basophils (N.o et al, 1» 
However this result should perhaps be questioned, as the octapeptide 
nnlv appears to inhibit at molar concentrations several orders of magm- 
2 a S than those required with intact human myeloma IgE.. A 
reXs of resonance energy transfer studies provide further evidence 
for Ae possible involvement of the C.2/C.3 interface region in Fc.M 
bTnSng (Holowka and Baird, 1983; Holowka et al, 1985; Zheng ei al 
1991) Distances between fluorescent donor probes, placed a specific 

uL on gE and anti-IgE antibodies, and acceptor probes at the ee l 
membrane surface were determined. These measurements indicate 
WE bound to Fc.RI has a bent conformation, with die C.2 and C.3 

doma °ns"ying closest to the cell membrane and therefore presumably 

T^^TlUs have utilized a pane, of chimeric IgE 
molecules to assess the relative contribution of each domain. One 
Zup found that mutant mouse IgE molecules lacking either 45 amino 
Sfrom the carboxy end of C.3 or almost the entire ■ C^»£ 
longer bound to rat Fc.RI (Schwarzbaum et al .1989). Further the 

sites of recognition of two anti-IgE *? h ^ S ^™^T°^ et 
the lgE-Fc.Rl interaction (Baniyash and Eshhar 1984 Ban.yash et 
al, 1988) were localized to the C.3 domain. A third an anybody 
known not to inhibit" receptor binding, was shown to bind the C.4 
domain. These observations were interpreted as indicating that the 
C 3 domain plays a key role in Fc.RI recognition, whereas the C.4 
dfmain although not directly involved in the interaction, serves to 
stabilize the conformation of Fc. necessary for Fc.RI binding. 

A further study made use of the ability of human IgE to . bind to 
human Fc.RI but not to rat Fc.RI, in contrast to the reactivity of mouse 
IgE with both receptors (Nissim et al, 1991). Chimeric human/mouse 
IgE molecules, in which single or multiple mouse domains ^tuted 
for human ones, were assessed for binding to either rat Fc.RI or a 
reconstituted human Fc.RI expressed in COS cells. When the C 2 
domains of human IgE were replaced by those of i • *«• 
resultant molecule bound human but not rat Fc«RI. In contrast, a 
chimeric human IgE containing mouse C.3 domains (CHM3) bound 
both receptors. Furthermore, deletion of C.2 from CHM3 Produced no 
impairment of binding to rat Fc.RI. Again, these results suggest that 
the C.3 domain is the principal region involved in interaction with 

Fc.RI (Nissim et al, 1991). ... . F 

A second group of investigators has generated chimeric mou e lgb 
in which on? or more IgE domains are substituted by homologous 
regions from human IgG, (Weetall et al, 1990). An IgE molecule ,n 
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which the C e 4 domains were replaced by IgG Cy3 domains bound rat 
Fc € RI with affinity comparable to that of wild-type mouse IgE. All 
other chimeric molecules, tested did not contain both C t 2 and C c 3 and 
were unable to bind rat Fc € RI. The favored interpretation was that both 
C c 2 and C € 3 domains are necessary for the binding interaction (Weetall 
et ai, 1990). However, the observations mentioned earlier (Nissim et 
al, 1991) suggest that the role of C e 2 here may be in stabilizing the 
conformation of C c 3 necessary for optimal Fc e RI interaction. A substi- 
tuted human IgGi hinge may not be able to perform this role ade- 
quately. This possibility may explain the inability of a chimeric human 
IgG molecule, in which mouse C e 3 domains substitute for C y 2 do- 
mains, to interact with rat Fc £ RI (Weetall et al., 1990). The current 
weight of evidence, therefore, would seem to support the idea that the 
recognition site for Fc e RI lies (1) within the region encoded by the C £ 3 
exon and (2) within the peptide Gin 301-Leu 363. The region of Fc« 
fulfilling both these criteria lies between Asp 330 and Leu 363. Of 
particular pertinence is the emergence of a new molecular model for 
the Fc region of IgE (Helm et al., 1991). This revised model incorpo- 
rates the finding that the inter-C € 2 disulfide bonds involving Cys 238 
and Cys 241 are parallel, rather than crossed as in the earlier models 
(Padlan and Davies, 1986; Pumphrey, 1986). The most pronounced 
consequence of modeling parallel disulfide bridges is the appearance 
of an exposed segment of approximately 2.4 nm in length, comprising 
residues 329-335 lying between C<2 and C e 3 (Fig. 3). This region may 
constitute the structural equivalent of the lower hinge region in IgG 
and hence it is tempting to speculate that the Fc c RI interaction site 
may lie here. Should further mutagenesis experiments verify this pos- 
sibility, a common theme of Ig-like Fc receptor domains interacting 
with a flexible lower hinge region (or its equivalent) in immunoglobu- 
lins may emerge. 

3. FctRII 

The second class of receptor for the Fc region of IgE (Fc«RII or 
CD23) has lower affinity for its ligand than does Fc t RI and hence is 
sometimes referred to as the low-affinity receptor. Fc c RII is present on 
inflammatory cells, including monocytes, eosinophils, and platelets, 
and on B lymphocytes. cDNA clones coding for human FcJUI have 
been isolated and shown to bind IgE when expressed in mammalian 
cell systems (Kikutani et ai, 1986; Ikuta et ai, 1987; Liidin et ai, 
1987). Unlike all other leukocyte Fc receptors described here, Fc e RII 
is not related to the immunoglobulin gene family of proteins. Rather it 
displays homology to a family of animal lectins, which includes the 
human and rat asialoglycoprotein recepjtors. 
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On SDS gels human Fc.RII has a molecular weight of about 43,000 
and is composed of a 321-amino acid polypeptide core of -36,000 
molecular weight and both O- and N-linked oligosaccharides (Dele- 
soesse et al. 1989). The receptor spans the membrane once in a rather 
unusual orientation, because the short 23-amino acid N-terminal do- 
main lies inside the cell and the much longer C-terminal region is 
found to the exterior. The domain exhibiting homology to animal lec- 
tins spans about 120 residues of the extracellular portion and includes 
three cysteine pairs. Close to the C-terminus lies an Arg-Gly-Asp 
(RGD) sequence in reverse, i.e., DGR (Kikutani et al, 1986). A number 
of molecules that bind to the integrin family of receptors contain this 
RGD motif, suggesting that FceRII may be able to interact with adhe- 
sion molecules (Gordon et al, 1989). 

More recently, a second species of human FcJRII, termed Fc«Kilb, 
has been identified and differs from the earlier described receptor 
(Fc Rlla) only in the first few amino acids at the N-terminus (Yokota et 
al *1988). mRNA for Fc.RIIa is constitutively expressed in B cells 
alone. In contrast, Fc t RIIb is expressed in monocytes, eosinophils, and 
B cells only after stimulation with IL-4 (Yokota et al, 1988). Function- 
ally Fc Rlla appears to be involved in the regulation of B cell develop- 
ment (Gordon et al, 1989), whereas Fc £ RIIb plays a role in IgE- 
dependent cytotoxity against parasites such as schistosomes (Capron 
and Dessaint, 1985). 

4. Fc t RII-lgE Interaction at the Molecular Level 
Monomelic human IgE binds to FcJUIb with a K a of about 3 x 
10 7 AT 1 (Anderson and Spiegelberg, 1981; Joseph et al, 1986). Di- 
meric IgE may have a slightly higher affinity for the receptor than 
monomers, at least in the rat system (Finbloom and Metzger 1982). 
The natural occurrence of soluble proteolytic cleavage products oi 
Fc.RH, termed sFc.RII, soluble CD23, or IgE-BF, which are capable 
of binding IgE, indicates that the site of interaction lies in the C- 
terminal part of the extracellular portion (Letellier et al, 1989). In- 
deed, expression of recombinant soluble Fc e RII has localized the 
binding site within a 172-amino acid stretch at the C-terminus 
(Uchibayashi et al, 1989). This C-terminal portion also includes the 
region of homology with animal lectins. Despite the implication that 
Fc.RII may therefore interact with the oligosaccharide chains of IgE, 
this is not the case. Certain recombinant human e-chain fragments 
synthesized in E. coli and therefore devoid of carbohydrate are still 
able to bind to human Fc«RII (Vercelli et al, 1989). Further, enzymab- 
cally deglycosylated myeloma IgE interacts with the receptor In fact, 
it appears to bind slighdy more tighdy to the receptor than does the 
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parent IgE. Moreover, high concentrations of mono- and disaccharides 
do not inhibit the interaction between mouse IgE and Fc e RII. The 
interaction does, however, share a dependence on calcium and pH 
with other lectin proteins (Richards and Katz, 1990). 

Use of recombinant Fc c fragments mentioned above has allowed 
further localization of the Fc e RII interaction site on IgE (Vercelli etal., 
1989). A fragment (rE2-4) corresponding to the whole of the Fc[ 
(paired C e 2, C e 3, and C t 4 domains) bound to Fc e RII on a human B cell 
line with comparable affinity to intact IgE. A shorter fragment, com- 
prising paired C t 3 and C € 4 domains, also bound but with much lower 
affinity. An intermediate peptide (rE2'-4) with the 30 C-terminal resi- 
dues of C e 2 plus C e 3 and C e 4 was almost as active as the whole Fc e . In 
contrast, removal of C e 4 (leaving C € 2 plus C € 3) generated an inactive 
peptide. Thus, receptor recognition appears to require the presence of 
all three domains. The C c 4 domain, however, may be substituted for by 
the C y 3 domain of mouse IgC 2b without considerable loss of reactivity 
with F c € RII. The indirect role of the C e 4 domain would thus appear to 
be to promote the dimerization of the two e chains, necessary for 
receptor binding. This is borne out by the demonstration that 
replacement of Phe 506, lying at the interface between the paired C t 4 
domains, with Arg generates a monomeric form of the Fc € chain that is 
unable to bind to Fc.RII (Vercelli et ai, 1989). Unlike Fc t RI, binding 
of Fc e RII therefore has a clear requirement for both heavy chains in 
Fc e . One possibility is that two Fc e RII molecules may interact simulta- 
neously with one IgE molecule. The observation thatdimers of mouse 
Fc £ RII may preexist adds weight to this idea (Peterson and Conrad 
1985). 

Anti-IgE monoclonal antibodies against epitopes lying between res- 
idues 307-315 in C € 2 and residues 367-370 in C«3 exhibit marked 
abilities to inhibit the IgE-Fc e RII interaction (Chretien et ai, 1988). 
These two regions may lie close to one another in three-dimensional 
space (Fig. 3). An attractive hypothesis, incorporating the findings with 
recombinant peptides and monoclonal antibodies alike, is that the 
Fc«RII site may lie close to residues 367-370 in the C t 3 domain (Ver- 
celli et al, 1989). The anrj-C e 2 antibodies might then exert their effect 
by steric hindrance. 

C. Fc a Receptors 

The presence of receptors for IgA has been reported on human 
monocytes, macrophages, and neutrophils (Fanger et al, 1980; Malis- 
zewski et al., 1985; Chevailler et al, 1989), T cells (Briere et al, 1988; 
Millet et al, 1988), B cells (Gupta et al, 1979; Millet et al, 1989) 
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eosinophils (Abu-Ghazaleh et of., .1989), and NK cells (Kimata and 
Saxon 1988). Purification of the receptor from neutrophils and mono- 
cytes has revealed a heavily glycosylated protein of about 60 kDa 
(Albrechtsen et al, 1988; Monteiro et al, 1990). 

Fc receptors on monocytes and neutrophils are capable ol mediat- 
ing phagocytosis of IgA-coated target cells (Fanger et al, 1983; Gorter 
et al, 1987; Yeaman and Kerr, 1987). They may also serve to promote 
ADCC by synergism with Fc 7 receptors (Shen and Fanger, 1981).. 
Interaction of IgA, aggregated either artificially or at a cell surface, 
with monocyte and polymorphonuclear (PMN) Fc a receptors can trig- 
ger both the release of inflammatory mediators such as leukotrienes 
and prostaglandins and the generation of superoxide (Ferreri et aZ 
1986; Gorter et al, 1987; Stewart and Kerr, 1990; Padeh et al, 1991), 
Cross-linking of Fc a receptors in this way may also result in neutrophil 
degranulation (Albrechtsen et al, 1988). 

Recently a cDNA clone coding for a human Fc a receptor has been 
isolated from a monocyte-like cell line cDNA library (Maliszewski et 
al, 1990). COS cells transfected with the cDNA clone readily bind 
IgA-coated erythrocytes. The deduced amino acid sequence indicates 
an integral membrane protein with a peptide core of about 30 kDa. The 
remainder of the mass is contributed by carbohydrate moieties at- 
tached at up to six potential extracellular N-glycosylation sites and 
perhaps further O-glycosylation sites. The N-terminal 206 amino acids, 
lying outside the cell, comprise two immunoglobulin-like domains 
that display homology to the extracellular regions of Fc,RI, Fc 7 RII, 
FCyRIII, and the a-subunit of FcJRI. A single transmembrane segment 
of 19 hydrophobic amino acids is then followed by a C-terminal cy- 
toplasmic domain of 41 amino acids. Northern blot analysis revealed 
that mRNA coding for the Fc a receptor was present in peripheral blood 
monocytes and neutrophils. No message was detected in tonsillar B or 
T cells, suggesting either that the IgA receptor reported in these cell 
types is structurally distinct, or that receptor expression was not in- 
duced under the particular conditions used. 

f Ca R-/gA Interaction at the Molecular Level 

The human monocyte/PMN Fc a receptor appears to bind human 
serum IgA a and IgA 2 with similar affinity (Chevailler et al, 1989; 
Stewart and Kerr, 1990). Estimates using solubilized receptor indicate 
that both subclasses give half-maximal inhibition of the receptor-IgA 
interaction at concentrations of 4.8 x 10" 7 M, suggesting that the af- 
finity constant lies around 5 x 10 7 M~ l (Mazengera and Kerr, 1990). 
Human secretory IgA of both subclasses also exhibits very similar 
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inhibitory abilities. Thus, the receptor interacts equally well' with 
monomeric serum IgA and the dimeric IgA, in complex with J chain 
and secretory component, of secretory IgA. The site of interaction lies 
entirely in the Fc region of IgA, because Fc a can inhibit the binding of 
polymeric IgA to the receptor, as shown in an indirect immunofluo- 
rescence assay (Monteiro et al., 1990). Further, the rebinding of pu- 
rified receptor to IgA-Sepharose beads is inhibited by Fc a (Mazengera 
and Kerr, 1990). The above results with secretory IgA therefore sug- 
gest that the presence of either J chain or secretory component does not 
impede interaction of Fc Q R with the Fc region of secretory IgA. 

A pepsin digestion product of IgA, lacking the C a 3 domains, retains a 
somewhat reduced ability to interact with the neutrophil Fc Q receptor 
(Mazengera and Kerr, 1990). This may suggest that the presence of 
both the C a 2 and C a 3 domains is necessary for full reactivity with the 
receptor. It is possible, however, that the.C a 3 domain serves the indi- 
rect role of maintaining the conformation of C a 2 as we have seen with 
homologous Ig domains in the Fc y RI-IgG and Fc € RI-IgE interactions 
earlier. Site-directed mutagenesis experiments on human IgA should, 
in the future, help to elucidate the precise molecular requirements for 
binding to Fc a /eceptors (Woof et al., 1992). 

D. Fc M Receptors 

Subpopulations of human and mice B and T lymphocytes are re- 
ported to express functional Fc M receptors (Moretta et al., 1975; Fer- 
ranini et al., 1977; Mathur et al., 1988a,b). Biochemical analysis has 
revealed an IgM-binding protein of about 60 kDa on activated human 
B cells (Sanders et al., 1987). This molecule was not, however, de- 
tected on T cells, monocytes, or granulocytes. More recently, binding 
inhibition experiments have confirmed that this protein binds to the Fc 
portion of IgM and hence is a true Fc^ receptor (Ohno et aL, 1990). It 
appears to be expressed throughout the various stages of B cell differ- 
entiation and, in this respect, differs from the Fc M receptor on murine B 
cells (Mathur et al., 1988b). The human receptor is anchored to the 
membrane via a phosphatidylinositol-glycan linkage and possesses 
O-linked but not N-linked oligosaccharides. 

Human B cell Fc M receptor interacts with human and mouse IgM 
and, as mentioned previously, Fcg M fragments generated by hot trypsin 
digestion (Ohno et aL, 1990). As the major trypsin cleavage site lies in 
the C M 2 domain, the Fc 5m fragments consist primarily of paired C u 3 and 
C M 4 domains. The same report details experiments using mouse IgM 
domain deletion mutants, which help to further localize the Fc M R 
binding site on IgM. Loss of the C M 1 domain did not impair binding to 
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the receptor. Deletion of both domains and C M 2 resulted in re- 
duced but still significant binding. However, a mutant lacking domains 
C 1, C M 2, and C^3 and another lacking C M 4 no longer bound Fc M R. 
Hencef Fc M R seems to require the presence of both C^3 and C M 4 
domains for recognition. Further study will be necessary to determine 
whether each domain contributes direcdy or indirectly to binding. It is 
perhaps of interest to note that the C M 3 domain of mouse IgM appears 
to play the major role in binding to Fc M R on murine T and B cells 
(Mathur et al, 1988a,b). 

E. Fc 6 Receptors 

Receptors for human IgD have been detected on human B and T 
cells (Sjoberg, 1980; Rudders and Andersen, 1982; Tamma and Coico, 
1991). The receptors on B cells, at least, interact with the Fc portion of 
human IgD. No significant degree of information on the molecular 
basis of the interaction, in the human system, is currently available. 

F. Effector Cell-Target Cell Interaction Mediated by 
Leukocyte Fc Receptors 

We will now attempt to consider the interaction of antibodies and Fc 
receptors in a more physiological situation. First, we will discuss how 
antibody array formation at cell surfaces may facilitate interaction with 
Fc receptors. Second, we will consider the multiple factors that influ- 
ence the "linkage" of effector cell and target cell by antibody mole- 
cules. 

The possibility of antibody array formation on target cell surfaces 
would appear an attractive hypothesis. Earlier, we mentioned the 
potential interaction of arrays of antibody molecules, possibly sta- 
bilized by Fc-Fc interactions, with Clq. Arrays of dislocated IgG 
molecules, for example, might also be expected to interact advan- 
tageously with Fc receptors for two main reasons. First, multiple Fc 
receptors could bind simultaneously to an array of antibody molecules. 
The receptors would thus be effectively cross-linked, constituting a 
trigger for subsequent effector function. Second, dislocation of IgG 
molecules would serve to maximize access to the Fc^R interaction 
sites(s) lying in the lower hinge region by rotation of Fc regions per- 
pendicular to Fab arms (Burton, 1986). Similar arguments may also 
apply to IgE. Indeed, experimental evidence suggests that IgE bound 
to Fc«RI has a bent conformation in which the C«2/C t 3 interface, 
interacting with the receptor, lies about 45 A away from the cell sur- 
face. The remainder of the Fc lies about 55 A away from the membrane 
whereas the tips of the Fab arms extend some 100 A away from the cell 
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surface (Holowka and Baird, 1983; Holowka et al. t 1985; Zheng et a/., 
1991). Further experimental evidence for antibody arrays was dis- 
cussed earlier. 

We will now turn to the factors influencing recognition of antibody- 
coated target cells by FcR-bearing effector cells. Because many FcR+ 
cells express more than one class of receptor, and an opsonized target 
cell may be coated with multiple antibody isotypes, the in vivo situa- 
tion is presumably rather complex. It seems likely, for example, that 
different types of Fc receptor may act synergistically to trigger effector 
mechanisms (Crockett-Torabi and Fantone, 1990; Kimberly et al., 
1990; Koolwijk et qL, 1991). In order to simplify matters here we will 
discuss a model system involving the interaction of erythrocyte targets, 
coated with a single antibody isotype, with Fey receptor-bearing effec- 
tor cells. This simplification allows assessment of the cell-cell interac- 
tion by rosette formation, the microscopically visible binding of sev- 
eral erythrocytes to an effector cell. 

The interaction of the opsonized target with the effector cell in the 
rosette, in energetic terms, can be seen as the result of two opposing 
contributions, described by the following equation (Walker et al. t 
1989b): 

AG(rosette formation) = AG(Ab-FcR)interaction 

+ AG(nonspecific cell-cell interaction) 

The first term [AG(Ab-FcR)interaction], promoting rosette formation, 
is associated with the free energy of occupation of Fc receptors by 
antibody molecules. The second opposing term [AG(nonspecific cell- 
cell interaction)], militating against rosette formation, is associated 
with the repulsive forces of bringing together two cell surfaces close 
enough to allow bridging by antibody molecules. Hence, a sufficient 
input of free energy from antibody-receptor interactions to overcome 
cell-cell repulsion is necessary to allow rosette formation (see experi- 
mental examples in Fig. 18). These two terms appear to depend on a 
number of factors, each of which may influence rosette formation. 
Thus, the rosette-promoting term is governed by both the antibody 
isotype and the Fc receptor involved and by the number of antibody- 
receptor interactions. The opposing term depends on the net surface 
charge of the two cells and the geometries of the antigen epitope and 
the Fc receptor. Both antigen and receptor must be "accessible'* and 
suitably orientated to give optimal interaction with the antibody mole- 
cule. Finally, the relative structure of the antibody is important. For 
example, human IgGa molecules, with their extended hinge regions, 




ENNY M. WOOF 



iwka et al, 1985; Zhengef al, 
for antibody arrays was dis- 

icing recognition of antibody- 
tor cells. Because many FcR + 
;ptor, and an opsonized target 
dy isotypes, the in vivo situa- 
jems likely, for example, that 
lergistically to trigger effector 
tone, 1990; Kimberly et al, 
simplify matters here we will 
traction of erythrocyte targets, 
th Fey receptor-bearing effec- 
sment of the cell-cell interac- 
lically visible binding of sev- 



BBOHELIR TBUTED 
EFFECTOB CELL 



EFFtCTOR CtU 



Hp 



;t with the effector cell in the 
as the result of two opposing 
ing equation (Walker et al, 



:eraction 

specific cell-cell interaction) 

promoting rosette formation, 
:cupation of Fc receptors by 
ig term [AG(nonspecific cell- 
ette formation, is associated 
;ether two cell surfaces close 
lolecules. Hence, a sufficient 
ptor interactions to overcome 
rosette formation (see experi- 
terms appear to depend on a 
influence rosette formation, 
'erned by both the antibody 
I by the number of antibody- 
i depends on the net surface 
*s of the antigen epitope and 
:or must be "accessible 0 and 
:tion with the antibody mole- 
e antibody is important. For 
leir extended hinge regions, 



Fic 18 Schematic representation of effector cell-target cell interactions. Human 
lgG 3 antirhesus D or anti-NIP antibodies (open Y shapes) and mouse IgGi antiglycopho- 
rin A antibodies (striped Y shapes) interact with their respective antigens on the surface 
of a human rhesus D+ erythrocyte. Under favorable conditions, these antibodies may 
also interact simultaneously with the IgC-binding sites (dark areas) on Fc T receptors on a 
neighboring effector cell. The antibodies then serve to "bridge" between the two eel 
types a situation that, under the microscope, would appear as a rosette, with several 
erythrocytes bound per effector cell. Five different combinations of antigen, antibody, 
and Fa, receptor are shown here to illustrate the theoretical considerations discussed in 
the text. In the first case, on the left, a human lgC 3 anti-D antibody bridges between 
FaJU and the somewhat buried D antigen in the erythrocyte membrane. Thus, this 
illustrates the fact that an effector cell bearing Fc T RI (e.g., U937) can form rosettes with 
lgC 3 anti-D-sensitized erythrocytes. In the second case, however, human IgC 3 is unable 
to bridge between the relatively inaccessible D antigen and Fc^RH. This represents the 
finding that an effector cell bearing Fc,RH alone (e.g., Daudi) cannot form rosettes with 
IgC 3 anti-D-sensitized erythrocytes. In the third combination, the antigen, NIP, is at a 
more accessible location on the red blood cell surface. This allows the IgC 3 anti-NIP 
antibody to bind both to NIP and to Fc Y RIl. Hence, Daudi cells are able to form rosettes 
with NIP-derivatized erythrocytes coated with IgC 3 anti-NIP antibodies. In the fourth 
example, as in the second, the antibody, in this case a mouse IgG t molecule, is unable to 
bridge between the antigenic epitope on the red cell and Fc^RII on the effector cell. 
Hence, no rosettes are seen with this combination. However, when the Fc^RH* effector 
cell is pretreated with the enzyme bromelin, as in the fifth example, this same combina- 
tion of antigen, antibody, and Fc receptor may simultaneously interact. Bromelin treat- 
ment results in cleavage of surface glycoproteins, thereby reducing the net surface 
charge of the cell. The two cell surfaces may therefore approach each other more closely 
as mutual repulsion is reduced. Thus, Daudi cells, after bromelin treatment, become 
able to form rosettes with erythrocytes coated with the mouse IgCj antiglycophonn A 
antibody. 

appear to mediate rosette formation more readily than IgGi molecules 
do, when all other parameters are the same (Walker et al, 1988, 1989b). 
However, this interaction process is merely the first stage of target cell 
destruction and some evidence suggests that optimal interaction may 
not always lead to optimal destruction. Hence, although human IgG 3 
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molecules more readily promote target cell-effector cell interaction 
than do human Igd molecules, it is the latter antibody that mediates 
most effective ADCC against target cells once interaction has occurred 
(Briiggemann et al, 1987). 

In terms of the "design*' of antibodies for therapy, a consideration of 
the multiple parameters influencing interaction of target cell and effec- 
tor cell would seem pertinent. Thus, in order to generate a monoclonal 
antibody effective at Fc receptor-mediated target-cell killing, we must 
take into account not only the more obvious factors of antibody isotype, 
class of Fc receptor, and type of effector cell(s) involved, but also 
antigen density, epitope accessibility, and surface charge on the target 
cell. We must ensure that we have an antibody capable of both specifi- 
cally recognizing a target cell, and of "bridging" efficiently between 
that cell and an effector cell, prior to triggering target cell destruction. 



V. Catabolism of Antibodies 

Although not an effector function, the survival of antibodies in blood 
has important consequences in understanding the biochemistry of 
these molecules and for their potential therapeutic and diagnostic 
uses. The subject has recently been reviewed by Zuckier et al (1989) 
and we shall concentrate on points relating to molecular aspects. 

In humans, antibodies of the IgG class have the longest half-lives of 
any of the serum proteins (average t U2 = 21 days), with the other 
classes having shorter half-lives: IgA, 6 days; IgM, 5 days; IgD and 
IgE, 3 days. IgG 3 demonstrates a significantly shorter r 1/2 (7days) than 
the other IgG subclasses whereas the two IgA subclasses have similar 
ti/2- The unique feature of IgG catabolism in humans and in other 
species studied is that the catabolic rate is proportional to the serum 
concentration. Thus serum IgG is degraded much more rapidly in 
hypergammaglobulinemic individuals and much more slowly in hypo- 
gammaglobulinemics. This has led Brambell et al (1964) to propose 
that there are a limited number of receptors that complex IgG and 
protect it from degradation in blood. When levels of IgG are high, the 
receptors will be saturated, making more IgG available for degrada- 
tion. Conversely, when levels are low, most of the IgG will be pro- 
tected, prolonging serum survival. There is no direct evidence for this 
mechanism but it has been considered the best available (Waldmann 
and Strober, 1969). 

An alternative occurring to us is one based on an equilibrium be- 
tween monomeric and associated IgG-species. Higher serum concen- 
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trations of IgG would favor aggregate formation that 
lized more rapidly than monomer. Such an aggregated population 
might form only a small proportion of the total IgG, and therefore be 
difficult to observe, but be catabolized very rapidly. The site(s) regulat- 
ing the catabolism of IgG (whatever the molecular mechanism in- 
volved) is believed to be in the Fc part of the molecule because Fc 
fragments are catabolized at the slow rates characteristic of IgG, in 
contrast to Fab fragments, which are rapidly cleared (Spiegelberg and 
Weigle, 1965a,b; Wochner et al, 1967; Zuckier et al, 1989). Further- 
more, infusion of Fc fragments can mimic IgG in accelerating the 
catabolism of circulating IgG in mice (Fahey and Robinson, 1963). 
Interestingly, the Fc fragments of all four human IgG subclasses ap- 
pear to have identical fractional catabolic rates, implying that struc- 
tures outside the Fc region are responsible for the accelerated catabo- 
lism of IgG 3 (Spiegelberg and Fishkin, 1972). 

A number of studies have sought to localize the site more precisely. 
Yasmeen et al (1976) reported that a Cy2 fragment from a human IgGi 
protein was cleared from rabbit circulation with a tyz similar to intact 
IgG and Fc and much longer than that for Fab, pFc', or C y 3 fragments. 
Arend and Webster (1977) reported that rat pFc' was rapidly catabo- 
lized in rats compared to Fc. These studies imply a crucial role for the 
domain. In contrast, Pollock et al (1990), using mutant mouse 
IgG molecules, have obtained evidence that deletion of any of the con- 
stant domains has an effect on clearance in the mouse. Further, using 
IgG2b/2a hybrid molecules, they suggest that sequences at the C- 
terminal end of C y 2 or within the domain, or conformations con- 
trolled by these sequences, are important in catabolism. 

Recently Wawrzyncak et al (1992a) looked at the rates of clearance, 
from the circulation of mice, of mutant mouse IgG 2 bS used in the 
Clq-binding and Fc receptor studies described earlier. They found no 
significant differences between t U 2 values for mutant and wild-type 
IgGs, implying that clearance is independent of the ability to bind Clq 
or mouse FcRI (mutation of Glu to Leu at position 235 was shown to 
generate IgG 2 b binding to mouse as well as human FcRI). 

Though there has been debate about the role of carbohydrate, the 
present consensus seems to imply that the carbohydrate moieties have 
only a limited effect on serum half-life (Waldmann and Strober, 1969; 
Tao and Morrison, 1989; Zuckier et al 1989; Wawrzynak et al 1992b). 
However, the terminal galactose residues on IgA molecules, and in 
some cases on other Ig classes, do target them to the hepatic galactose 
receptor and thus can play a decisive role in their catabolism (Zuckier 
« <*l, 1989). 
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VI. Bacterial Fc Receptors 

Certain bacteria express on their surface proteins capable of binding 
specially to the Fc region of mammalian immunoglobulins. These 
so-called bacterial Fc receptors have been demonstrated on manv 
staphylococcal and streptococcal strains (Boyle and Reis, 1987) Those 
receptors specific for the Fc of IgG have currently been classified 
according to their functional reactivity with different species and sub- 
classes of IgG, into six groups or types. The nature of the interaction of 
each of type with human IgG will be dealt with in turn. Relatively little 
is currently understood, in precise molecular terms, about the interac- 
tion of bacterial Fc receptors with other classes of immunoglobulin 
These types of receptors will not, therefore, be discussed here. 

A. Type I Fc Receptor 

Type I Fc receptor, frequently termed protein A, is found on the 
surface of the majority of strains of Staphylococcus aureus (Forseren 
and Sjoquist, 1966). This very extended protein has been cloned and 
found to consist of five homology units, each capable of binding the Fc 
ot IgG and a sixth region that does not bind Fc but binds to cell walls 
(Moks et al 1986). Active 7-kDa fragments, each corresponding to a 
homology domain, can be generated by trypsin digestion. It is possible 
for two such fragments to bind simultaneously to one Fc molecule. A 
typical binding affinity (K a ~ 3 x 10 s M" 1 ) is demonstrated by frag- 
ment B binding to rabbit Fc (Lancet et al, 1978). 

Earlier crystallography data (Deisenhofer et al, 1976 1978) have 
been refined to produce a model of the complex of human Fc and 
fragment B at 2.8 A resolution (Deisenhofer, 1981). Fragment B forms 
two contacts with Fc molecules in the crystals but one of these is 
argued to be merely a crystal contact. In the other, thought to exist in 
solution, fragment B binds at the interface between the C v 2 and C 3 
domains of IgG (Fig. 19). The residues involved comprise parts of two 
hydrophobic patches on Fc, one on the C y 2 domain (Met 252 He 253 
Ser 254, Leu 309, His 310, and Glu 311) and the other on* the 03 
domain (His 433, His 435, Tyr 436, and Asn 434). 

Protein A binds the human subclasses IgG,, IgC 2 , and IgG 4 , and 
also IgG 3 molecules of the allotype IgG 3 m(15, 16), characteristic of 
mongoloid populations (Recht et al, 1981). Each of these proteins has 
a histidine residue at position 435 involved in the protein A interac- 
tion. By contrast, in IgG 3 molecules of the Caucasian, allotypes 
IgG 3 m(5) and IgG 3 m(21), this histidine is replaced by arginine. Model 
building (Deisenhofer, 1981) reveals that the inability of such proteins 
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Dwek, 1983; Nose and Wigzell, 1983). It has been demonstrated re- 
cently, by use of l H NMR, that aglycosylation results in only a small 
and localized structural change in the vicinity of the His 268 reporter 
group at the N-terminal end of the C y 2 domain (Lund et al., 1990). 
Hence it is hardly surprising that loss of the carbohydrate residues 
from Fc does not affect the interaction with protein A. Similarly, nei- 
ther reduction and alleviation nor hinge deletion of human Igd per- 
turbs the Gy2-Gy3 interface region, as demonstrated by the continued • 
reactivity of these modified IgCs with protein A. 

B. Type II Fc Receptor 

This second class of receptor is associated with certain strains of 
group A streptococci. There is, however, a considerable degree of 
heterogeneity in the IgG subclass binding profiles among different 
group A isolates. Further, the binding profile of a particular isolate may 
vary with passage and even individual colonies within that isolate may 
express a variety of IgG-binding capacities (Yarnall et al., 1984). Thus 
binding of IgG to these bacteria is complex and appears to be a func- 
tion of several different type II Fc receptors. A recent study has de- 
fined five subtypes of the receptor based on their IgG species and 
subclass specificities (Raeder et al., 1991a). 

Type Ha receptor binds human IgGi, IgG2, and IgG 4 , but not IgG3. 
It also binds rabbit, pig, and horse IgGs. Immunoblotting techniques 
with the strain 64/14 have demonstrated a molecule of —50 kDa with 
these binding properties (Yarnall and Boyle, 1986a). More recently, an 
IgG-binding protein, cloned from the group A strain CS 1 10 (Heath and 
Cleary, 1987, 1989), displayed the IgG-binding profile of type Ha 
receptor (Cleary and Heath, 1990). A study assessing the ability of 
various dipeptides to inhibit the binding of radiolabeled IgG to strain 
64/14 provided information on the localization of the type Ha receptor 
site on IgG (Yarnall and Boyle, 1986b). The dipeptides glycyltyrosine 
and glycylhistidine the binding of human .IgGi, Ig^, an( * *&G4 anc * 
rabbit and pig IgG to 64/14. This result, taken with the human IgG 
subclass specificity profile of this receptor, suggests that the IgG- 
binding site for the type Na receptor may be very similar to that of the 
type I receptor. Histidine and tyrosine residues would appear to be 
important in the interaction. Interestingly, an IgG3 molecule capable 
of binding protein A, and therefore presumably of an allotype with 
histidine at position 435, was found to bind the type Ha receptor on 
64/14 (Yamall and Boyle, 1986a). Reactivity of only certain IgG3 allo- 
types with the cloned receptor was also noted (Cleary and Heath, 
1990). 
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The tvDe lib Fc receptor has been demonstrated on the group A 
J,ins 64/14 A992S, and 11434 and substrain A928 A2 (Raeder et al, 
1991S) with a molecular weight of -35,000 in the two former strains 
Z<\ -47 000 in the two latter. It binds solely to human IgG 3 (Raeder et 
al 1991a- Yarnall and Boyle, 1986a). The largest differences between 
I<rC-,and the other human subclasses reside in its very extended hinge 
reeion suggesting that a potential interaction site for type lib receptor 
may He in this part of the Fc. Attempts to inhibit the interaction with 
monoclonal antibodies specific for the hinge of human IgC 3 (Lowe et 
al 1982) might prove informative. 

Type lie receptor, a 116-kDa protein demonstrated on the strain 
A992S has been defined as displaying specificity for human IgGi and 
rabbit ' pig, and horse IgCs, and weak reactivity with human IgG 4 . 
Sequence comparison of the C r 2 and domains between the human 
IgG subclasses reveals that all residues conserved in both human IgOi 
and IgC 4 are also conserved in IgG 2 and IgG 3 . Thus the specificity ot 
type lie receptor is difficult to explain by sequence differences in the 
ligands Rather, the idea that distinct IgG-binding reactivities may be 
the sum of a number of independent binding units coming together in 
different combinations (Raeder et al., 1991a) may be more appropriate. 

Type Ilo receptor, present on the substrain A928 Al, binds all hu- 
man IgG subclasses as well as rabbit, pig, and horse IgCs (Raeder et 
al., 1991a). This specificity is associated with a 47-kDa proteuv The 
Bfth receptor subgroup, type II'o, is a variant of the above and binds 
only human IgG (all four subclasses) and rabbit IgG. A protein display- 
ing this specificity has been cloned from strain API and given the 
alternative name of protein H (Akesson et al, 1990; Gomi et al, 1990). 
It has a molecular weight of -40,000 and shows homology to protein 
Arp, an IgA-binding streptococcal protein, but not to protein A, 
protein G (see later), or type Ha Fc receptor. Protein H is reported to 
block the binding of both protein A and protein G to IgG, suggesting it 
may also interact with the C y 2/Cy3 interface (Akesson et al, 1990). 

C. Type III Fc Receptor 

Type III Fc receptor is expressed on the surface of most human 
group C and group G streptococci. This receptor, more frequently 
termed protein G, exhibits a remarkably wide reactivity, interacting 
with all human IgG subclasses and IgG from rabbit, goat, cow, sheep, 
rat, mouse, guinea pig, horse, and pig. It displays a considerably higher 
affinity (K a ~ 1 x 10 9 M" 1 for human IgG subclasses) than does protein 
A (Reis et al, 1984; Akerstrom and Bjorck, 1986). Type III receptor is 
reported to have some affinity for Fab fragments of IgG mediated at a 
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site distinct from that binding to Fc (Erntell et al, 1988). In addition 
the receptor has affinity for human serum albumin. 

The type III receptor has been successfully cloned from two differ- 
ent group G sources (Fahnestock et al, 1986; Guss et al, 1986; Olsson 
et al, 1987). The deduced amino acid sequence shares features in 
common with that of protein A. Starting at the C-terminus, there is a 
region responsible for cell wall anchorage, followed by six repeated 
elements predicted to adopt a conformation of linearly arranged do- 
mains. The capacity to bind serum albumin is mediated by the N- 
terminal half of the domain structure (Guss et al, 1986), whereas the 
site forlgG lies in the C-terminal half of the molecule (Akerstrdm et al, 
1987; Sjobring et al, 1988). Despite the similarity in overall organiza- 
tion of type III receptor and protein A, there is no homology between 
their IgG-binding regions, suggesting that they arose by convergent 
evolution. 

In order to localize the type III receptor site on Fc, two different 
approaches have been used. First, the ability of enzymatically derived 
fragments of IgG to interact with the receptor was assessed in direct 
binding and inhibition studies (Schroder et al, 1986; Stone et al, 
1989). The pFc 7 fragment (C^ dimer) of both human and rabbit IgCs 
did not bind to the receptor. Both human and rabbit F(ab')2 fragments 
failed to inhibit binding of radiolabeled Fc to group C and G strep- 
tococcal strains. Further, the C y 3- lacking rabbit Facb fragment showed 
little inhibitory ability. In the second approach, chemical modification 
of IgG implicated the involvement of IgG tyrosines in the interaction 
(Stone et al, 1989). These results suggest that type III receptor binds 
to an interaction site very similar to that of protein A, especially since 
the monovalent fragment D of protein A inhibits the binding of type III 
protein to IgG. Indeed, type III receptor was also shown to be capable 
of inhibiting the binding of IgM rheumatoid factors, which generally 
bind to the C^-G^ domain interface on IgG. However, a marked 
difference between protein A and type III receptor is that the former is 
sensitive to the His 435 -> Arg interchange in the IgG 3 molecules of 
most Caucasians, but the latter is not. This suggests that this residue, 
clearly central in protein A binding, is on the periphery of the type III 
receptor site. Subtle differences in the recognition processes of the two 
receptors may also explain the differences in pH optimum for IgG 
binding to protein A (-pH 8) and type III receptor (pH 4-5) 
(Akerstrom and Bjorck, 1986). X-Ray crystallographic analysis of the 
complex of Fc and type III receptor (or a fragment of it) would perhaps 
now allow definite identification of the interaction on both ligand and 
receptor. - 
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D Type IV Fc Receptor 

Tvne IV Fc receptors are associated with some ^hemolytic bovine 
J u p G streptococci. They are capable of binding human IgCi \%G^ 
fnd IgG 4 and IgG from several other species, parhcu ^ly rabbu (Re^ 
Tal 1990; Raeder et al, 1991b). Sequence analysis of the C 7 2 and ^3 
domains of rabbit IgG and the human IgG isotypes reveals only three 
tl! which are conserved in rabbit IgG, human Igd and IgG* 
E norintman IgG, These are residues Leu 235, Gly 236 and Ala 
3» allTn the cJ f domain, the latter residue being located on the 
Lterio surface oi the domain. Hence, one might speculate that a 
?^l**n^ site may encompass residues 235 and 236 which, 
as mentioned earlier, form part of the lower hinge site for the high- 
affinity human Fc^RI. Attempts to inhibit the binding to one receptor 
with soluble fragments of the other might be an illuminating future 
experiment. 
E. Type V Fc Receptor 

This Rfth bacterial receptor for the Fc of IgG is found on certain 
strains of Streptococcus zooepidemicus. It has a binding specificity 
very similar to that of protein A, displaying affinity for human IgOi. 
IgG 2) and IgC 4 and pig, guinea pig, and rabbit IgGs^nly weak reac- 
tivity is reported with cow, sheep, goat, horse, rat, dog, and cat IgUs 
(Myhre and Kronvall, 1980). More recently, a subtype of this receptor 
has been described (Yarnall and Widders, 1990). The subtype protein 
has a molecular weight of -45,000 and similar specificity to the origi- 
nal type V receptor except that it has strong reactivity with cat and 
horse IgGs. This subtype V receptor may recognize a site on Fc distinct 
from the protein A site because protein A apparently stall bind to IgU 
after binding of subtype V receptor in blotting experiments. This is 
somewhat surprising considering the similar binding profiles ot the 
two receptors, especially with the human IgG subclasses. Further 
binding inhibition studies with native receptors might aid clarification 
of the situation. 
F. Type VI Receptor 

An IgG-binding protein of ~46 kDa, present on the surface of 
S. zooepidemicus strain S212, has been designated type VI Fc receptor 
(Reis et al, 1988). It has specificity for rabbit, pig, sheep, goat, and cow 
IgGs and weaker reactivity with human, mouse, and rat IgGs. This 
affinity for rat IgG is, however, the greatest displayed by any bacterial 
Fc receptor thus far described. An explanation of this binding profile in 
terms of IgG site localization is, with present knowledge, difficult. 
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VII. Conclusions 

Considerable advances have, been made in the past few years f n 
delineating the sites on antibodies interacting with effector molecules. 
Although far from complete, such information allows much to be done 
in terms of rational design of effector function. For instance, single 
point mutations can be used to eliminate Clq and Fc receptor (FcRl 
and FcRII) binding from an IgG molecule. More subtly, point muta- 
tions can be used to eliminate complement lysis while leaving Clq 
binding unaffected or to eliminate FcRI binding with small effects on 
FcRII. In the opposite experiment, a single point mutation can be used 
to convert an IgG displaying no measurable affinity for FcRI into one 
with fully functional affinity. Generally, the introduction of function to 
an inactive antibody is likely to be more demanding of the level of our 
understanding, and here we have further to go. 

A recurring theme in this review is that binding of the appropriate 
antibody isotype to antigen is a necessary but not sufficient criterion 
for effector triggering. Examples abound of antibodies, for instance, 
that bind Clq but do not sustain a later step in the complement cas- 
cade. Similarly, there are antibodies that link effector and target cells 
effectively but do not lead to target damage. There would appear to be 
extra requirements associated with the antigen. Further, this does not 
appear to be simply a question of antigen density. One explanation 
might be that certain antigens are able to trigger allosteric changes 
whereas others are not. We have discussed why we think this unlikely. 
The explanation we favor is that antibodies linking arrays of antigen 
and effector molecules have preferred arrangements for optimal effec- 
tor function. Some antigens, or their local environment, would pre- 
clude the formation of such arrangements and so they would bind 
antibody but be unable to activate the effector function. This hypothe- 
sis could accommodate the independent sensitivities of many of the 
steps of the complement cascade. For instance, a particular antibody 
arrangement could be sufficient for Clq binding and CI activation but 
inappropriate for C4 activation. 

Accepting for a moment the notion of preferred arrangements, how 
might they look? We have described some circumstantial reasons for 
favoring the formation of IgG hexamers involving dislocated IgG mol- 
ecules with Fc-Fc interactions, but have no hard data to support this as 
yet. Certainly IgM appears to function best with respect to comple- 
ment activation in a "preferred arrangement" (hexamer) with Fc-Fc 
interactions. Dislocation of antibody molecules is a feature that is 
difficult not to embrace given the localization of Fc receptor binding 
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regions to the middle part of the antibody IgG and IgE molecules. In 
simple terms it is difficult to envisage the topology of the bridging of 
two cells by antibody without invoking movement of Fc out of the 
plane of the two Fab arms. The available physical data support the 
notion that such movements should be allowed. Indeed, flexibility of the 
antibody molecule could be useful in a number of ways when linking 
antigen and effector. For instance, dislocation and Fab arm opening 
will both serve to bring effector molecules closer to the antigenic 
surface. Thus, for example, the surface of an effector cell will be 
brought closer to that of the target cell (Fig. 16), facilitating target cell 
destruction, and the generation of activated C4b and C3b will occur 
closer to the target cell membrane, facilitating complement lysis. 
Again, flexibility is expected to be advantageous given the diversity of 
antigens that must be linked to the common effector systems. 

Clearly, much of this discussion is conjecture. One would like to 
have experimental measurements on the interacting triumvirate of 
antigen, antibody, and effector molecule. As discussed earlier this is a 
tall order, but it is to be hoped that appropriate methodologies will be 
developed. 

An interesting feature of antibody effector function is the occurrence 
of the human IgG subclasses. Typically one wonders about the role of 
the IgG 2 and IgG 4 subclasses given their very poor reputations in this 
area. We would suggest the following propositions: IgGi and IgG3 
both appear to mediate effector functions, although the relative effi- 
cacy of the two may vary according to the conditions of effector trigger- 
ing. IgG3, with its long hinge, would seem to have an advantage in 
bringing antigen and effector closer together in that repulsive forces, 
e.g., between cell surfaces, would be minimized. IgGi, on the other 
hand, by bringing antigen and effector closer together, may facilitate 
target damage. It could be that, in vivo, the two function cooperatively. 
IgG 2 does activate complement under conditions of high epitope den- 
sity and this provides a "rationale" for the preponderance of IgG2 
anticarbohydrate antibodies (Michaelsen et ai, 1991) because carbo- 
hydrates are often presented at high density on microbial surfaces and 
at lower densities more ubiquitously. IgG 2 does not generally interact 
with Fc receptors except for one form of FcRII as discussed. IgG 4 does 
not appear to activate complement under any circumstances. It does 
interact with FcRI, albeit more weakly than Igd and IgG 3l but not 
with FcRII or FcRIII. It may be that there are situations, e.g., blocking 
of the function of certain viruses, wherein it is desirable to have anti- 
body binding without, e.g., cellular uptake. 

Many effector studies are now carried out with monoclonal antibod- 
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ies, although in vivo, of course, the response is polyclonal. Because the of 1 
effector systems tend to recognize arrays of antibodies that could differ 

depending on the composition of the antibodies involved, the distinc- Antlei 

tion may be important. An example is provided by the synergistic N l" 

effect of two monoclonal antibodies on complement activation de- moi 

scribed previously. As more human monoclonal antibodies become Ande: 

available (Burton, 1991; Burton et ai, 1991; Persson et aL, 1991) this is P hi 

an area which should be explored. It is our guess that we shall find that ~? 5 

antibodies work best as mixtures, both in terms of epitope specificity An e 
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